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Preface 


For many years the Landolt-Bórnstein—Group I Elementary Particles, Nuclei and 
Atoms: Vol. 21A (Physics and Methods Theory and Experiments, 2008), Vol. 21B1 
(Elementary Particles Detectors for Particles and Radiation. Part 1: Principles 
and Methods, 2011), Vol. 21B2 (Elementary Particles Detectors for Particles and 
Radiation. Part 2: Systems and Applications), and Vol. 21C (Elementary Particles 
Accelerators and Colliders, 2013) has served as a major reference work in the field 
of high-energy physics. 

When, not long after the publication of the last volume, open access (OA) 
became a reality for HEP journals in 2014, discussions between Springer and CERN 
intensified to find a solution for the “Lab6” which would make the content available 
in the same spirit to readers worldwide. This was helped by the fact that many 
researchers in the field expressed similar views and their readiness to contribute. 

Eventually, in 2016, on the initiative of Springer, CERN and the original Labó 
volume editors agreed in tackling the issue by proposing to the contributing authors 
a new OA edition of their work. From these discussions, a compromise emerged 
along the following lines: transfer as much as possible of the original material into 
open access; add some new material reflecting new developments and important 
discoveries, such as the Higgs boson; and adapt to the conditions due to the change 
from copyright to a CC BY 4.0 license. 

Some authors were no longer available for making such changes, having either 
retired or, in some cases, deceased. In most such cases, it was possible to find 
colleagues willing to take care of the necessary revisions. A few manuscripts could 
not be updated and are therefore not included in this edition. 

We consider that this new edition essentially fulfills the main goal that motivated 
us in the first place—there are some gaps compared to the original edition, as 
explained, as there are some entirely new contributions. Many contributions have 
been only minimally revised in order to make the original status of the field available 
as historical testimony. Others are in the form of the original contribution being 
supplemented with a detailed appendix relating to recent developments in the field. 
However, a substantial fraction of contributions has been thoroughly revisited by 
their authors resulting in true new editions of their original material. 


vi Preface 


We would like to express our appreciation and gratitude to the contributing 
authors, to the colleagues at CERN involved in the project, and to the publisher, 
who has helped making this very special endeavor possible. 


Vienna, Austria Christian Fabjan 
Geneva, Switzerland Stephen Myers 
Geneva, Switzerland Herwig Schopper 
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Chapter 1 ff) 
Introduction Geek for 


Herwig Schopper 


Since old ages it has been one of the noble aspirations of humankind to understand 
the world in which we are living. In addition to our immediate environment, planet 
earth, two more remote frontiers have attracted interest: the infinitely small and 
the infinitely large. A flood of new experimental and theoretical results obtained 
during the past decades has provided a completely new picture of the micro- and 
macrocosm and surprisingly intimate relations have been discovered between the 
two. It turned out that the understanding of elementary particles and the forces 
acting between them is extremely relevant for our perception of the cosmological 
development. Quite often scientific research is supported because it is the basis 
for technical progress and for the material well-being of humans. The exploration 
of the microcosm and the universe contributes to this goal only indirectly by the 
development of better instruments and new techniques. However, it tries to answer 
some fundamental questions which are essential to understand the origins, the 
environment and the conditions for the existence of humankind and thus is an 
essential part of the cultural heritage. 

One of the fundamental questions concerns the nature of matter, the substance 
of which the stars, the planets and living creatures are made, or to put it in another 
way—can the many phenomena which we observe in nature be explained on the 
basis of a few elementary building blocks and forces which act between them. The 
first attempts go back 2000 years when the Greek philosophers speculated about 
indestructible atoms, like Democritus, or the four elements and the regular bodies 
of Plato. 


H. Schopper (54) 
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Since Newton who introduced infinitely hard smooth balls as constituents of 
matter! and who described gravitation as the first force acting between them, the 
concept of understanding nature in terms of 'eternal building blocks hold together 
by forces has not changed during the past 200 years. What has changed was the 
nature of the elementary building blocks and new forces were discovered. The 
chemists discovered the atoms of the 92 elements which, however, contrary to their 
name, were found to be divisible consisting of a nucleus surrounded by an electron 
cloud. Then it was found that the atomic nuclei contain protons and neutrons. 
Around 1930 the world appeared simple with everything consisting of these three 
particles: protons, neutrons and electrons. 

Then came the ‘annus mirabilis’ 1931 with the discovery of the positron as the 
first representative of antimatter and the mysterious neutrino in nuclear beta-decay 
indicating a new force, the weak interaction. In the following decades the 'particle 
zoo’ with all its newly discovered mesons, pions and ‘strange’ particles was leading 
to great confusion. Simplicity was restored when all these hundreds of ‘elementary 
*particles could be understood in terms of a new kind of elementary particles, the 
quarks and their antiquarks. The systematics of these particles is mainly determined 
by the strong nuclear force, well described today by the quantum chromodynamics 
QCD. Whether quarks and gluons (the binding particles of the strong interaction) 
exist only inside the atomic nuclei or whether a phase transition into a quark-gluon 
plasma is possible, is one the intriguing questions which still needs an answer. 

Impressive progress was made in another domain, in the understanding of the 
weak nuclear force responsible for radioactive beta-decay and the energy production 
in the sun. Three kinds of neutrinos (with their associated antiparticles) were 
found and recently it could be shown that the neutrinos are not massless as 
had been originally assumed. The mechanism of the weak interaction could be 
clarified to a large extent by the discovery of its carriers, the W- and Z-particles. 
All the experimental results obtained so far will be summarized in this volume 
and the beautiful theoretical developments will be presented. The climax is the 
establishment of the “Standard Model of Particle Physics’ SM which has been shown 
to be a renormalizable gauge theory mainly by the LEP precision experiments. 
The LEP experiments have also shown that there are only three families of quarks 
and leptons (electron, muon, tau-particle and associated neutrinos), a fact not yet 
understood. 

All the attempts to find experimental deviations from the SM have failed so far. 
However, the SM cannot be the final theory for the understanding of the microcosm. 
Its main fault is that it has too many arbitrary parameters (e.g. masses of the 
particles, values of the coupling constants of the forces, number of quark and lepton 
families) which have to be determined empirically by experiment. An underlying 
theory based on first principles is still missing and possible ways into the future will 
be discussed below. 


!Isaac Newton, Optics, Query 31, London 1718. 
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Returning to the ‘naive’ point of ultimate building blocks one might ask whether 
the quarks and leptons are fundamental indivisible particles or whether they have a 
substructure. Here we are running into a dilemma which was recognised already by 
the philosopher Immanuel Kant.? Either ultimate building blocks are mathematical 
points and cannot be divided, but then it is difficult to understand how they can have 
a mass and carry charges and spin. Alternatively, the building blocks might have 
spatial extension, but then it is hard to understand why they could not be divided into 
smaller parts. Whenever one meets such a paradox in science it is usually resolved 
by recognising that a wrong question was asked. 

Indeed the recent developments of particle physics indicate that the naive concept 
of ultimate building blocks of matter has to be abandoned. The smaller the *building 
blocks’ are, the higher energies are necessary to break them up. This is simply a 
consequence of the Heisenberg uncertainty principle of quantum mechanics. In the 
case of quarks their binding energies become so strong that any energy applied 
to break them apart is used to produce new quark-antiquark pairs.? The existence 
of antimatter implies also that matter does not have an 'eternal' existence. When 
matter meets antimatter the two annihilate by being converted into ‘pure’ energy and 
in the reverse mode matter can be produced from energy in the form of particle- 
antiparticle pairs. 

One of the most exciting development of physics or in science in general is a 
change of paradigms. Instead of using building blocks and forces acting between 
them, it was progressively recognised that symmetry principles are at the basis of 
our understanding of nature. It seems obvious that laws of nature should be invariant 
against certain transformations since ‘nature does not know’ how we observe it. 
When we make experiments we have to choose the origin of the coordinate system, 
its orientation in space and the moment in time when we start the observation. These 
choices are arbitrary and the laws deduced from the observations should not depend 
on them. It is known since a long time that the invariance of laws of nature against 
the continuous transformations, i.e. translations and rotations in space and time, 
give rise to the conservation of momentum, angular momentum and energy, the 
most basic laws of classical physics.? The mirror transitions (i.e. spatial reflection, 
particle-antiparticle exchange and time reversal) lead to the conservation of parity 
P, charge parity C and detailed balance rules in reactions, all of which are essential 
ingredients of quantum mechanics. 

The detection of complete parity violation in weak interactions in 1957 was one 
of the most surprising observations. Many eminent physicists, including Wolfgang 


?Immanuel Kant, Kritik der reinen Vernunft, 1781, see, e.g., Meiner Verlag, Hamburg 1998, or 
translation by N.K. Smith, London, MacMillan 1929. 

The binding energies are comparable to mc”, where m is the rest mass of a quark and c is the 
velocity of light. 

^When Pope Paul John II visited CERN and I explained to him that we can ‘create’ matter his 
response was: you can ‘produce’ matter, but ‘creation’ is my business. 

5Emmy Noether, Nachr. d. kónigl. Gesellschaft d. Wissenschaften zu Gottingen, 1918, page 235. 
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Pauli, thought that this symmetry could not be violated. Such a believe indeed goes 
back to Emanuel Kant? who claimed that certain ‘a priori’ concepts have to be valid 
so that we would be able to explore nature. Since it seemed obvious that nature does 
not know whether we observe it directly or through a mirror a violation of mirror 
symmetry seemed unacceptable. This phenomenon is still not understood, although 
the fact that also C conservation is completely violated and the combined symmetry 
PC seemed to hold has reduced somewhat the original surprise. The whole situation 
has become more complicated by the detection that PC is also violated, although 
very little. A deep understanding of the violation of these ‘classical’ symmetries is 
still missing. So far experiments show that the combined symmetry PCT still holds 
as is required by a very general theorem. 

In field theories another class of more abstract symmetries has become 
important—the gauge symmetries. As is well known from Maxwell's equations the 
electrodynamic fields are fully determined by the condition that gauge symmetry 
holds, which means that the electric and magnetic fields are independent against 
gauge transformations of their potentials. It was discovered that analogous gauge 
symmetries determine the fields of the strong and weak interactions in which case 
the (spontaneous) breaking of the symmetries plays a crucial role. 

In summary, we have abandoned the description of nature in terms of hard 
indestructible spheres in favour of abstract ideas—the symmetries and there break- 
ing. From a philosophical point of view one might, in an over-simplistic way, 
characterize the development as moving away from Democritus to Plato. 

Finally, it should be mentioned that in particle physics progress was only possible 
by an intimate cooperation between theory and experiments. The field has become 
so complex that by chance discoveries are extremely rare. The guidance by theory 
is necessary to be able to put reasonable questions to nature. This does not exclude 
great surprises since many theoretical predictions turned out to be wrong. Indeed 
most progress could be made by verifying or disproving theories. 

Although the Standard Model of Particle Physics SM (with some extensions, e.g. 
allowing for masses of neutrinos) has achieved a certain maturity by being able to 
reproduce all experimental results obtained so far, it leaves open many fundamental 
questions. One particular problem one has gotten accustomed to, concerns P and C 
violations which are put into the SM *by hand'. And as has been mentioned above 
the SM leaves open many other questions which indicate that it cannot be a final 
theory. 

In 2008 I wrote the concluding paragraph of this introduction as “Many 
arguments indicate that a breakthrough in the understanding of the microcosm will 
happen when the results of LHC at CERN will become available. LHC will start 
operation in 2008, but it will probably take several years before the experiments 
will have sufficient data and one will be able to analyse the complicated events 
before a major change of our picture will occur, although surprises are not excluded. 
Hence it seems to be an appropriate time to review the present situation of our 
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understanding of the microcosm". Meanwhile, more than 10 years later, and with the 
Higgs boson discovered in 2014 at the LHC, the extended SM has been confirmed 
with unprecedented precision yet the outstanding questions, in particular which path 
to follow beyond the SM, have remained with us. 


Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 
International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, 
adaptation, distribution and reproduction in any medium or format, as long as you give appropriate 
credit to the original author(s) and the source, provide a link to the Creative Commons licence and 
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Gauge Theories and the Standard Model gee 


Guido Altarelli and Stefano Forte 


2.1 Introduction to Chaps. 2, 3 and 4 
Stefano Forte 


The presentation of the Standard Model in Chap. 2, Chaps. 3 and 4 was originally 
written by Guido Altarelli in 2007. In this introduction we provide a brief update 
(with references), and a discussion of the main developments which have taken place 
since the time of the writing. 

Chapter 2 presents the architecture of the Standard Model, the way symmetries 
are realized and the way this can is described at the quantum level. The structure 
of the Standard Model is now well-established since half a century or so. The 
presentation in this chapter highlights the experimental (and thus, to a certain 
extent, historical Chap. 2) origin of the main structural aspects of the theory. The 
only aspects of the presentation which require (minimal) updating are the numerical 
values given for parameters, such as the Fermi coupling constant G r, see Eq. (2.3). 
All of these parameters have been known quite accurately since the early 2000s 
(with the exception of neutrino masses, see Sect. 3.7 of Chap. 3), and thus their 
values are quite stable. The numbers given below are taken from the then-current 
edition of the Particle Data Book (PDG) [7]. At any given time, in order to have the 
most recent and accurate values, the reader should consult the most recent edition 


The author *G. Altarelli" is deceased at the time of publication. 
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of the PDG [25], preferably using the web-based version [26], which is constantly 
updated. 

Chapter 3 presents the Electroweak sector of the Standard Model, which was 
established as a successful theory by extensive experimentation at the LEP electron- 
positron collider of CERN in the last decade of the past century, including some 
aspects of the theory, such as the CKM mechanism for mass mixing (see Sect. 3.6) 
which were originally often considered to be only approximate. The discovery, at 
the turn of the century, of neutrino mixing, and thus non-vanishing neutrino masses 
(see Sect. 3.7) has been the only significant addition to the minimal version of the 
electroweak theory as formulated in the sixties and seventies of the past century. 
The general understanding of electroweak interactions was thus essentially settled 
at the time of the writing of this chapter. 

From the experimental point of view, the main development since then is the 
successful completion of the first two runs of the LHC, which have provided further 
confirmation of the standard Electroweak theory (see Ref. [27] for a recent review). 
From a theoretical point of view, the main surprise (from the LHC, but also a number 
of other experiments) is that there have been no surprises. 

First and foremost, the Higgs sector of the Standard Model: after discovery of 
the Higgs boson in 2012 [28, 29] the Higgs sector has turned out so far to be 
in agreement with the minimal one-doublet structure presented in Sect. 3.5. The 
discussion presented there, as well as the phenomenology of the Standard Model 
Higgs of Sect. 3.13, remain thus essentially unchanged by the Higgs discovery. A 
theoretical introduction with more specific reference to the LHC can be found in 
Ref. [30], while the current experimental status of Higgs properties can be found in 
the continually updated pages of the CERN Higgs cross-section working group [31]. 
Perhaps, the only real surprise in the Higgs sector of the Standard Model is the 
extreme closeness of the measured Higgs mass to the critical value required for 
vacuum stability (see Sect. 3.13.1 below)—a fact with interesting cosmological 
implications [32]. The discovery of the Higgs has changed somewhat the nature of 
global fits of Standard Model parameters discussed in Sect. 3.12: with the value of 
the Higgs mass known, the fit is over-constrained—though the conclusion of global 
consistency remains unchanged. An updated discussion is given in Ref. [27], as well 
as in the review on the Electroweak Model by Erler and Freitas in the PDG [26]. 

Besides Higgs discovery, the general trend of the last several years has been that 
of the gradual disappearance of all anomalies—instances of discrepancy between 
Standard Model predictions and the data—either due to more accurate theory 
calculations (or even the correction of errors: see Sect. 3.9), or to more precise 
measurements. A case in point is that of the measurements of the electroweak 
mixing angle, discussed in Sect. 3.12: the tensions or signals of disagreement 
which are discussed there have all but disappeared, mostly thanks to more accurate 
theoretical calculations. Another case in which the agreement between Standard 
Model and experiment is improving (albeit perhaps more slowly) is that of lepton 
anomalous magnetic moments, discussed in Sect. 3.9. In both cases, updates on the 
current situation can again been found in Ref. [27], and in the aforementioned PDG 
review by Erler and Freitas. 
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Finally, there is a number of cases in which data from LHC experiments (as 
well as other experiments, specifically in the fields of flavor physics and neutrino 
physics) have brought more accuracy and more stringent tests, without changing the 
overall picture. These include gauge boson couplings, discussed in Sects. 3.3-3.4, 
for which we refer to Ref. [27]; the CKM matrix and flavor physics, discussed in 
Sect. 3.6, for which we refer to the review by Ceccucci, Ligeti and Sakai in the 
PDG [26]; neutrino masses and mixings, discussed in Sect. 3.7, for which we refer 
to the PDG review by Gonzalez-Garcia and Yokohama [26]. 

This perhaps unexpected success of the Standard Model, and the failure to find 
any evidence so far of new physics (and in particular supersymmetry) at the LHC 
has somewhat modified the perspective on the limitations of the Standard Model 
discussed in Sect. 3.14. Specifically, the significance of the hierarchy problem—the 
so-called “naturalness” issue—must be questioned, given that it entails new physics 
which has not be found: a suggestive discussion of this shift in perspective is in 
Ref. [33]. Yet, the classification of possible new physics scenarios of Sect. 3.14 
remains essentially valid: recent updates are in Ref. [34] for supersymmetric models, 
and in Ref. [35] for non-supersymmetric ones. Consequently, looking for new 
physics has now become a precision exercise, and this has provided a formidable 
stimulus to the study of Electroweak radiative corrections, which has been the 
subject of very intense activity beyond the classic results discussed in Sect. 3.10: 
a recent detailed review is in Ref. [36]. 

Chapter 4 is devoted to the theory of strong interactions, Quantum Chromody- 
namics (QCD). This theory has not changed since its original formulation in the 
second half of the past century. Specifically, its application to hard processes, which 
allows for the use of perturbative methods, is firmly rooted in the set of classic 
results and techniques discussed in Sect. 4.5 below. What did slowly change over 
the years is the experimental status of QCD. What used to be, in the past century, a 
theory established qualitatively, has gradually turned into a theory firmly established 
experimentally—though, at the time this chapter was written, not quite tested to the 
same precision as the electroweak theory (see Sect. 4.7). Now, after the first two 
runs of the LHC, it can be stated that the whole of the Standard Model, QCD and the 
Electroweak theory, are tested to the same very high level of accuracy and precision, 
typically at the percent or sub-percent level. 

Turning QCD into a precision theory has been a pre-requisite for successful 
physics at the LHC, a hadron collider in which every physical process necessarily 
involves the strong interaction, since the colliding objects are protons (or nuclei). 
This has grown into a pressing need as the lack of discovery of new particles or 
major deviations from Standard Model predictions has turned the search for new 
physics signals into a precision exercise: it has turned the LHC from an "energy 
frontier" to a “rarity/accuracy frontier" machine—something that was deemed 
inconceivable just before the start of its operation [37]. 

This rapid progress has happened thanks to an ever-increasing set of computa- 
tional techniques, which, building upon the classic results presented in this chapter, 
has allowed for an enormous expansion of the set of perturbative computations of 
processes at colliders which are introduced in Sect. 4.5.4, and discussed in more 
detail in the context of LHC (and specifically Higgs) physics in Ref. [30]. 
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To begin with, basic quantities such as the running of the coupling, discussed in 
Sect. 4.4, and R,+,-, discussed in Sect. 4.5.1 are now know to one extra perturbative 
order (see the QCD review of the PDG [26] for the current state of the art and 
full references). These are five-loop perturbative calculations, now made possible 
thanks to the availability of powerful computing resources. Furthermore, the set 
of processes discussed in Sect. 4.5.4 has now been extended to include essentially 
all relevant hadron collider processes, which have been routinely computed to 
third perturbative order, while the first fourth-order calculations have just started 
appearing. Again, the QCD review of the PDG [26] provides a useful status update, 
including comparison between computation and experiment, which refer to cross- 
sections which span about ten orders of magnitude in size. 

This progress has been happening thanks to the development of a vast new 
set of computational techniques, which, rooted in perturbative QCD, have now 
spawned a dedicated research field: that of amplitudes [38], which relates phe- 
nomenology, quantum field theory, and mathematics. The classic set of methods for 
"resummation"—the sum of infinite classes of perturbative contributions, discussed 
specifically in Sect. 4.5.3.1 for deep-inelastic scattering, has been extended well 
beyond the processes and accuracy discussed in Sect. 4.5.4—an up-to-date list is 
in the QCD review of the PDG [26]. Moreover, an entirely new set of resummation 
techniques has been developed, using the methodology of effective field theories: the 
so-called soft-collinear effective theory (SCET) which provides an extra tool in the 
resummation box [39]. One remarkable consequence of all these developments is 
that it is now possible to understand in detail the structure of pure strong interaction 
events, in which jets of hadrons are produced in the final state, by looking inside 
these events and tracing their structure in terms of the fundamental fields of QCD— 
quarks and gluons [40]. 

One topic in which things have changed rather less is the determination of the 
strong coupling, discussed in Sect. 4.7. Whereas the agreement between predicted 
and observed scaling violations discussed in Sect. 4.6.3 is ever more impressive (see 
the review on structure functions of the PDG [26]) the accuracy on the determination 
of the strong coupling itself has not improved much. Updated discussions can be 
found in the QCD review of the PDG, as well as in Ref. [41]. Progress is likely 
to come from future, more accurate LHC data, as well as from non-perturbative 
calculations [42] (not discussed here) soon expected to become competitive. 

All in all, the dozen or so years since the original writing of these chapter have 
seen a full vindication of the Standard Model as a correct and accurate theory, and 
have stimulated a vast number of highly sophisticated experimental and theoretical 
results which build upon the treatment presented below. 


2.2 Introduction 


The ultimate goal of fundamental physics is to reduce all natural phenomena to a set 
of basic laws and theories that, at least in principle, can quantitatively reproduce and 
predict the experimental observations. At microscopic level all the phenomenology 
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of matter and radiation, including molecular, atomic, nuclear and subnuclear 
physics, can be understood in terms of three classes of fundamental interactions: 
strong, electromagnetic and weak interactions. In atoms the electrons are bound to 
nuclei by electromagnetic forces and the properties of electron clouds explain the 
complex phenomenology of atoms and molecules. Light is a particular vibration 
of electric and magnetic fields (an electromagnetic wave). Strong interactions bind 
the protons and neutrons together in nuclei, being so intensively attractive at short 
distances that they prevail over the electric repulsion due to the equal sign charges of 
protons. Protons and neutrons, in turn, are composites of three quarks held together 
by strong interactions to which quarks and gluons are subject (hence these particles 
are called “hadrons” from the Greek word for "strong"). To the weak interactions 
are due the beta radioactivity that makes some nuclei unstable as well as the nuclear 
reactions that produce the enormous energy radiated by the stars and by our Sun 
in particular. The weak interactions also cause the disintegration of the neutron, the 
charged pions, the lightest hadronic particles with strangeness, charm, and beauty 
(which are "flavour" quantum numbers) as well as the decay of the quark top and of 
the heavy charged leptons (the muon jz~ and the tau t~). In addition all observed 
neutrino interactions are due to weak forces. 

All these interactions are described within the framework of quantum mechanics 
and relativity, more precisely by a local relativistic quantum field theory. To each 
particle, described as pointlike, is associated a field with suitable (depending on 
the particle spin) transformation properties under the Lorentz group (the relativistic 
space-time coordinate transformations). It is remarkable that the description of all 
these particle interactions is based on a common principle: "gauge" invariance. A 
"gauge" symmetry is invariance under transformations that rotate the basic internal 
degrees of freedom but with rotation angles that depend on the space-time point. 
At the classical level gauge invariance is a property of the Maxwell equations of 
electrodynamics and it is in this context that the notion and the name of gauge 
invariance were introduced. The prototype of all quantum gauge field theories, 
with a single gauged charge, is QED, Quantum Electro-Dynamics, developed in 
the years from 1926 until about 1950, which indeed is the quantum version of 
Maxwell theory. Theories with gauge symmetry, at the renormalizable level, are 
completely determined given the symmetry group and the representations of the 
interacting fields. The whole set of strong, electromagnetic and weak interactions 
is described by a gauge theory, with 12 gauged non-commuting charges, which is 
called “the Standard Model” of particle interactions (SM). Actually only a subgroup 
of the SM symmetry is directly reflected in the spectrum of physical states. A part of 
the electroweak symmetry is hidden by the Higgs mechanism for the spontaneous 
symmetry breaking of a gauge symmetry. 

For all material bodies on the Earth and in all geological, astrophysical and cos- 
mological phenomena a fourth interaction, the gravitational force, plays a dominant 
role, while it is instead negligible in atomic and nuclear physics. The theory of 
general relativity is a classic (in the sense of non quantum mechanical) description 
of gravitation that goes beyond the static approximation described by Newton law 
and includes dynamical phenomena like, for example, gravitational waves. The 
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problem of the formulation of a quantum theory of gravitational interactions is 
one of the central problems of contemporary theoretical physics. But quantum 
effects in gravity become only important for energy concentrations in space-time 
which are not in practice accessible to experimentation in the laboratory. Thus the 
search for the correct theory can only be done by a purely speculative approach. 
All attempts at a description of quantum gravity in terms of a well defined and 
computable local field theory along similar lines as for the SM have so far failed to 
lead to a satisfactory framework. Rather, at present the most complete and plausible 
description of quantum gravity is a theory formulated in terms of non pointlike 
basic objects, the so called "strings", extended over distances much shorter than 
those experimentally accessible, that live in a space-time with 10 or 11 dimensions. 
The additional dimensions beyond the familiar 4 are, typically, compactified which 
means that they are curled up with a curvature radius of the order of the string 
dimensions. Present string theory is an all-comprehensive framework that suggests 
a unified description of all interactions together with gravity of which the SM would 
be only a low energy or large distance approximation. 

A fundamental principle of quantum mechanics, the Heisenberg indetermination 
principle, implies that, for studying particles with spatial dimensions of order Ax or 
interactions taking place at distances of order Ax, one needs as a probe a beam of 
particles (typically produced by an accelerator) with impulse p Z, h/Ax, where fi 
is the reduced Planck constant (A = h/2zr). Accelerators presently in operation or 
available in the near future, like the Large Hadron Collider at CERN near Geneva, 
allow to study collisions between two particles with total center of mass energy up 
to 2E ~ 2pc S 14 TeV. These machines, in principle, can allow to study physics 
down to distances Ax > 10715 cm. Thus, on the basis of results from experiments 
at existing accelerators, we can confirm that, down to distances of that order of 
magnitude, indeed electrons, quarks and all the fundamental SM particles do not 
show an appreciable internal structure and look elementary and pointlike. We expect 
that quantum effects in gravity will certainly become important at distances Ax = 
107? cm corresponding to energies up to E ~ Mpic? ~ 10? GeV, where Mp; 
is the Planck mass, related to Newton constant by Gy = fic/M ae At such short 
distances the particles that so far appeared as pointlike could well reveal an extended 
structure, like for strings, and be described by a more detailed theoretical framework 
of which the local quantum field theory description of the SM would be just a low 
energy/large distance limit. 

From the first few moments of the Universe, after the Big Bang, the temperature 
of the cosmic background went down gradually, starting from kT ^ Mp;c?, where 
k = 8.617 . .. 1075 eV K7! is the Boltzmann constant, down to the present situation 
where T ^ 2.725 K. Then all stages of high energy physics from string theory, 
which is a purely speculative framework, down to the SM phenomenology, which is 
directly accessible to experiment and well tested, are essential for the reconstruction 
of the evolution of the Universe starting from the Big Bang. This is the basis for the 
ever increasing relation between high energy physics and cosmology. 
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2.3 Overview of the Standard Model 


The SM is a gauge field theory based on the symmetry group SU (3)6 SU (2) GU (1). 
The transformations of the group act on the basic fields. This group has 8+3+1= 
12 generators with a non trivial commutator algebra (if all generators commute 
the gauge theory is said to be “abelian”, while the SM is a “non abelian” gauge 
theory). SU (3) is the “colour” group of the theory of strong interactions (QCD: 
Quantum Chromo-Dynamics [1—3]). SU (2) & U (1) describes the electroweak (EW) 
interactions [4—6] and the electric charge Q, the generator of the QED gauge group 
U (1)g, is the sum of 73, one of the SU (2) generators and of Y/2, where Y is the 
U (1) generator: Q = 73 + Y/2. 

In a gauge theory to each generator T is associated a vector boson (also said 
gauge boson) with the same quantum numbers as 7, and, if the gauge symmetry is 
unbroken, this boson is of vanishing mass. These vector (i.e. of spin 1) bosons act as 
mediators of the corresponding interactions. For example, in QED the vector boson 
associated to the generator Q is the photon y. The interaction between two charged 
particles in QED, for example two electrons, is mediated by the exchange of one 
(or seldom more than one) photon emitted by one electron and reabsorbed by the 
other one. Similarly in the SM there are 8 massless gluons associated to the SU (3) 
colour generators, while for SU (2) & U (1) there are 4 gauge bosons Wt*,W-,z? 
and y. Of these, only the photon y is massless because the symmetry induced by 
the other 3 generators is actually spontaneously broken. The masses of W+, W^ 
and Z? are quite large indeed on the scale of elementary particles: mw ~ 80.4 GeV, 
mz ~ 91.2 GeV are as heavy as atoms of intermediate size like rubidium and 
molibdenum, respectively. In the electroweak theory the breaking of the symmetry is 
of a particular type, denoted as spontaneous symmetry breaking. In this case charges 
and currents are as dictated by the symmetry but the fundamental state of minimum 
energy, the vacuum, is not unique and there is a continuum of degenerate states 
that all together respect the symmetry (in the sense that the whole vacuum orbit is 
spanned by applying the symmetry transformations). The symmetry breaking is due 
to the fact that the system (with infinite volume and infinite number of degrees of 
freedom) is found in one particular vacuum state, and this choice, which for the SM 
occurred in the first instants of the Universe life, makes the symmetry violated in 
the spectrum of states. In a gauge theory like the SM the spontaneous symmetry 
breaking is realized by the Higgs mechanism (described in detail in Sect. (2.7)): 
there are a number of scalar (i.e. of zero spin) Higgs bosons with a potential that 
produces an orbit of degenerate vacuum states. One or more of these scalar Higgs 
particles must necessarily be present in the spectrum of physical states with masses 
very close to the range so far explored. It is expected that the Higgs particle(s) will 
be found at the LHC thus completing the experimental verification of the SM. 

The fermionic (all of spin 1/2) matter fields of the SM are quarks and leptons. 
Each type of quark is a colour triplet (1.e. each quark flavour comes in three colours) 
and also carries electroweak charges, in particular electric charges 2/3 for up-type 
quarks and — 1/3 for down-type quarks. So quarks are subject to all SM interactions. 
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Leptons are colourless and thus do not interact strongly (they are not hadrons) but 
have electroweak charges, in particular electric charges — 1 for charged leptons (e7, 
u` and t~) while it is O for neutrinos (ve, v, and vr). Quarks and leptons are 
grouped in 3 "families" or "generations" with equal quantum numbers but different 
masses. At present we do not have an explanation for this triple repetition of fermion 


families: 
uU uu Ve CCC Vy ttt 
, 2.1 
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The QCD sector of the SM has a simple structure but a very rich dynamical 
content, including the observed complex spectroscopy with a large number of 
hadrons. The most prominent properties of QCD are asymptotic freedom and 
confinement. In field theory the effective coupling of a given interaction vertex is 
modified by the interaction. As a result, the measured intensity of the force depends 
on the transferred (four)momentum squared, Q?, among the participants. In QCD 
the relevant coupling parameter that appears in physical processes is a; = e2 [4n 
where e; is the coupling constant of the basic interaction vertices of quark and 
gluons: qqg or ggg (see Eq. (2.30)). Asymptotic freedom means that the effective 
coupling becomes a function of Q?: o,(Q?) decreases for increasing Q? and 
vanishes asymptotically. Thus, the QCD interaction becomes very weak in processes 
with large Q?, called hard processes or deep inelastic processes (i.e. with a final state 
distribution of momenta and a particle content very different than those in the initial 
state). One can prove that in 4 space-time dimensions all pure-gauge theories based 
on a non commuting group of symmetry are asymptotically free and conversely. 
The effective coupling decreases very slowly at large momenta with the inverse 
logarithm of Q?: æs (Q?) = 1/blog Q?/ A? where b is a known constant and A is 
an energy of order a few hundred MeV. Since in quantum mechanics large momenta 
imply short wavelengths, the result is that at short distances the potential between 
two colour charges is similar to the Coulomb potential, i.e. proportional to o, (r)/r, 
with an effective colour charge which is small at short distances. On the contrary the 
interaction strength becomes large at large distances or small transferred momenta, 
of order Q < A. In fact all observed hadrons are tightly bound composite states 
of quarks (baryons are made of qqq and mesons of qq), with compensating colour 
charges so that they are overall neutral in colour. In fact, the property of confinement 
is the impossibility of separating colour charges, like individual quarks and gluons 
or any other coloured state. This is because in QCD the interaction potential between 
colour charges increases at long distances linearly in r. When we try to separate the 
quark and the antiquark that form a colour neutral meson the interaction energy 
grows until pairs of quarks and antiquarks are created from the vacuum and new 
neutral mesons are coalesced and observed in the final state instead of free quarks. 
For example, consider the process e* e^ — qq at large center of mass energies. 
The final state quark and antiquark have large energies, so they separate in opposite 
directions very fast. But the colour confinement forces create new pairs in between 
them. What is observed is two back-to-back jets of colourless hadrons with a number 
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of slow pions that make the exact separation of the two jets impossible. In some 
cases a third well separated jet of hadrons is also observed: these events correspond 
to the radiation of an energetic gluon from the parent quark-antiquark pair. 

In the EW sector the SM inherits the phenomenological successes of the old 
(V — A) & (V — A) four-fermion low-energy description of weak interactions, and 
provides a well-defined and consistent theoretical framework including weak inter- 
actions and quantum electrodynamics in a unified picture. The weak interactions 
derive their name from their intensity. At low energy the strength of the effective 
four-fermion interaction of charged currents is determined by the Fermi coupling 
constant G r. For example, the effective interaction for muon decay is given by 


Lett = (Gr/V2) [Puya — ys)u] [ev* (1 — ys)ve] . (2.2) 
with [7] 
Gr = 1.16639(1) x 10? GeV? , (2.3) 


In natural units i = c = 1, Gr has dimensions of (mass) 2. As a result, the intensity 
of weak interactions at low energy is characterized by G p E?, where E is the energy 
scale for a given process (E ~ m,, for muon decay). Since 


Gr E? = Grm? (E/mp) = 10 9(E/mp? , (2.4) 


where m is the proton mass, the weak interactions are indeed weak at low energies 
(up to energies of order a few ten’s of GeV). Effective four fermion couplings for 
neutral current interactions have comparable intensity and energy behaviour. The 
quadratic increase with energy cannot continue for ever, because it would lead to a 
violation of unitarity. In fact, at large energies the propagator effects can no longer 
be neglected, and the current—current interaction is resolved into current-W gauge 
boson vertices connected by a W propagator. The strength of the weak interactions 
at high energies is then measured by gw, the W — —y-v, coupling, or, even better, 
by aw = Be, /47 analogous to the fine-structure constant œ of QED (in Chap. 3, 
gw is simply denoted by g or g2). In the standard EW theory, we have 


aw = V2 Gr my /n = 1/30. (2.5) 


That is, at high energies the weak interactions are no longer so weak. 

The range rw of weak interactions is very short: it is only with the experimental 
discovery of the W and Z gauge bosons that it could be demonstrated that rw is 
non-vanishing. Now we know that 


h 
ry = —— ~2.5 x 10716 cm, (2.6) 
mwc 
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corresponding to mw ~ 80.4 GeV. This very large value for the W (or the Z) 
mass makes a drastic difference, compared with the massless photon and the infinite 
range of the QED force. The direct experimental limit on the photon mass is [7] 
My < 6 10-7 eV. Thus, on the one hand, there is very good evidence that the 
photon is massless. On the other hand, the weak bosons are very heavy. A unified 
theory of EW interactions has to face this striking difference. 

Another apparent obstacle in the way of EW unification is the chiral structure 
of weak interactions: in the massless limit for fermions, only left-handed quarks 
and leptons (and right-handed antiquarks and antileptons) are coupled to W's. This 
clearly implies parity and charge-conjugation violation in weak interactions. 

The universality of weak interactions and the algebraic properties of the electro- 
magnetic and weak currents [the conservation of vector currents (CVC), the partial 
conservation of axial currents (PCAC), the algebra of currents, etc.] have been 
crucial in pointing to a symmetric role of electromagnetism and weak interactions 
at a more fundamental level. The old Cabibbo universality [8] for the weak charged 
current: 


JZ = Dy Yall — ys) + Veya(1 — ys)e + cos 0e yo (1 — ys)d + 


+ sin Oc üyg4(1l— ys)s +... , (2.7) 


suitably extended, is naturally implied by the standard EW theory. In this theory 
the weak gauge bosons couple to all particles with couplings that are proportional 
to their weak charges, in the same way as the photon couples to all particles in 
proportion to their electric charges [in Eq. (2.7), d’ = cos6, d + sin6, s is the 
weak-isospin partner of u in a doublet. The (u, d^) doublet has the same couplings 
as the (ve, £) and (vy, u) doublets]. 

Another crucial feature is that the charged weak interactions are the only known 
interactions that can change flavour: charged leptons into neutrinos or up-type 
quarks into down-type quarks. On the contrary, there are no flavour-changing neutral 
currents at tree level. This is a remarkable property of the weak neutral current, 
which is explained by the introduction of the Glashow-Iliopoulos-Maiani (GIM) 
mechanism [9] and has led to the successful prediction of charm. 

The natural suppression of flavour-changing neutral currents, the separate con- 
servation of e, u and t leptonic flavours that is only broken by the small neutrino 
masses, the mechanism of CP violation through the phase in the quark-mixing 
matrix [10], are all crucial features of the SM. Many examples of new physics tend 
to break the selection rules of the standard theory. Thus the experimental study of 
rare flavour-changing transitions is an important window on possible new physics. 

The SM is a renormalizable field theory which means that the ultra-violet 
divergences that appear in loop diagrams can be eliminated by a suitable redefinition 
of the parameters already appearing in the bare lagrangian: masses, couplings and 
field normalizations. As it will be discussed later, a necessary condition for a theory 
to be renormalizable is that only operator vertices of dimension not larger than 4 
(that is m^ where m is some mass scale) appear in the lagrangian density £ (itself 
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of dimension 4, because the action S is given by the integral of £ over d^x and 
is dimensionless in natural units: A = c = 1). Once this condition is added to 
the specification of a gauge group and of the matter field content the gauge theory 
lagrangian density is completely specified. We shall see the precise rules to write 
down the lagrangian of a gauge theory in the next Section. 


2.4 The Formalism of Gauge Theories 


In this Section we summarize the definition and the structure of a gauge Yang-Mills 
theory [11]. We will list here the general rules for constructing such a theory. Then 
these results will be applied to the SM. 

Consider a lagrangian density Z[$, 9,9] which is invariant under a D dimen- 
sional continuous group of transformations: 


óx)2U(059() (A=1,2,...,D). (2.8) 
with: 
U (6^) = explig $ 0^T^] — 1- ig; 0^T^ +..., (2.9) 
A A 


The quantities 0^ are numerical parameters, like angles in the particular case of a 
rotation group in some internal space. The approximate expression on the right is 
valid for 0^ infinitesimal. Then, g is the coupling constant and T ^ are the generators 
of the group I’ of transformations (2.8) in the (in general reducible) representation of 
the fields @. Here we restrict ourselves to the case of internal symmetries, so that T^ 
are matrices that are independent of the space-time coordinates and the arguments 
of the fields $ and ¢’ in Eq. (2.8) is the same. If U is unitary, then the generators T^ 
are Hermitian, but this need not be the case in general (though it is true for the SM). 
Similarly if U is a group of matrices with unit determinant, then the traces of T^ 
vanish: tr(T^) = 0. The generators T^ are normalized in such a way that for the 
lowest dimensional non-trivial representation of the group I (we use 1^ to denote 
the generators in this particular representation) we have 


1 
tr(t^t®) = 2 . (2.10) 


The generators satisfy the commutation relations 


[T^, TP] 2 iCAgc TC . (2.11) 
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For A, B, C ....upordown indices make no difference: T^ = T4 etc. The structure 
constants CApc are completely antisymmetric in their indices, as can be easily seen. 
In the following, for each quantity f^ we define 


f= x pg. (2.12) 
A 


For example, we can rewrite Eq. (2.9) in the form: 
U (0^) = exp[ig0] ~ 1+igd +..., (2.13) 


If we now make the parameters 0^ depend on the space-time coordinates 0^ = 
p^ (xu), LIM, In] is in general no longer invariant under the gauge transformations 
U[0^ (xy), because of the derivative terms: indeed 0,,¢' = 0,(U¢) # Udud. 
Gauge invariance is recovered if the ordinary derivative is replaced by the covariant 
derivative: 


Dy = 9, +igVy (2.14) 


where p are a set of D gauge vector fields (in one-to-one correspondence with the 
group generators) with the transformation law 


Vi, = UVU! — (1/ig)(@ U)UT! . (2.15) 


For constant 0^, V reduces to a tensor of the adjoint (or regular) representation of 
the group: 


Vom UV U! e Ve + 1818, Va] «s (2.16) 
which implies that 
VIC = VE — gCAnc0^ VP ..., (2.17) 
where repeated indices are summed up. 
As a consequence of Eqs. (2.14) and (2.15), D pġ has the same transformation 
properties as $: 
(Dub)! = (Dy) . (2.18) 


In fact 


(Duo) = (Ou +igV wb = (OLUP + Uduh c igUVA — (OLU) 
$ = U(Dyd). d 
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Thus Z[$, Du] is indeed invariant under gauge transformations. But, at this 
stage, the gauge fields y appear as external fields that do not propagate. In order to 


construct a gauge-invariant kinetic energy term for the gauge fields V^, we consider 
[D,, Dy,]ó = ig{9 V, — 0, Vu + ig[IVu, V] = igkwo , (2.20) 
which is equivalent to 


FA, = dp Vf —9,V? — gCAac VEVE. (2.21) 
From Eqs. (2.8), (2.18) and (2.20) it follows that the transformation properties of 
F 5 are those of a tensor of the adjoint representation 


F, = UFU! . (2.22) 


The complete Yang-Mills lagrangian, which is invariant under gauge transforma- 
tions, can be written in the form 


1 1 
Lym = -3TrF pF” + Lid, Dub] = -3 y FAF^" + Lib, Dud). 
A 
(2.23) 


Note that the kinetic energy term is an operator of dimension 4. Thus if £ is 
renormalizable, also Lym is renormalizable. In fact it is the most general gauge 
invariant and renormalizable lagrangian density. If we give up renormalizability then 
more gauge invariant higher dimension terms could be added. It is already clear at 
this stage that no mass term for gauge bosons of the form m? V, V" is allowed by 
gauge invariance. 

For an abelian theory, as for example QED, the gauge transformation reduces to 
U[0(x)] = exp[ie Q0 (x)], where Q is the charge generator. The associated gauge 
field (the photon), according to Eq. (2.15), transforms as 


V, = Vu — 8,00). (2.24) 


and the familiar gauge transformation by addition of a 4-gradient of a scalar function 
is recovered. The QED lagrangian density is given by: 
1 - 
usce ght Fay + M vaD- moy. (2.25) 
y 


Here D = D, y", where y" are the Dirac matrices and the covariant derivative is 
given in terms of the photon field A,, and the charge operator Q by: 


Dy = 8, 4 ieAuQ (2.26) 
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and 
Fuy = OpAv — 0 Ay (2.27) 


Note that in QED one usually takes the e~ to be the particle, so that Q = —1 and 
the covariant derivative is Dj, = 9,, — ie A, when acting on the electron field. In this 
case, the F,,, tensor is linear in the gauge field V, so that in the absence of matter 
fields the theory is free. On the other hand, in the non abelian case the F i tensor 
contains both linear and quadratic terms in v$, so that the theory is non-trivial even 
in the absence of matter fields. 


2.5 Application to QCD 


According to the formalism of the previous section, the statement that QCD is a 
renormalizable gauge theory based on the group SU (3) with colour triplet quark 
matter fields fixes the QCD lagrangian density to be 


8 nf 
1 I 
i= E AE + ) / 469 - moe; (2.28) 
—1 J= 


Here qj are the quark fields (of n f different flavours) with mass m; and D, is the 
covariant derivative: 


Dy, = Ou + lesgy; (2.29) 
es is the gauge coupling and later we will mostly use, in analogy with QED 
e 


a (2.30) 


as = 


Also, gj; = do, ie, where E A = 1,8, are the gluon fields and 1^ are the 
SU (3) group generators in the triplet representation of quarks (i.e. t4 are 3 x 3 
matrices acting on q); the generators obey the commutation relations [t4, t8] = 
iCapct© where C4zc¢ are the complete antisymmetric structure constants of SU (3) 
(the normalisation of CApc and of e; is specified by Tr[t4t?] = 84B 2); 


FA, = üug) — det — esCABCELRC (2.31) 


Chapter 4 is devoted to a detailed description of the QCD as the theory of 
strong interactions. The physical vertices in QCD include the gluon-quark-antiquark 
vertex, analogous to the QED photon-fermion-antifermion coupling, but also the 3- 
gluon and 4-gluon vertices, of order e, and e2 respectively, which have no analogue 
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in an abelian theory like QED. In QED the photon is coupled to all electrically 
charged particles but itself is neutral. In QCD the gluons are coloured hence self- 
coupled. This is reflected in the fact that in QED Fy» is linear in the gauge field, 
so that the term F p? in the lagrangian is a pure kinetic term, while in QCD F m is 
quadratic in the gauge field so that in F 5. we find cubic and quartic vertices beyond 
the kinetic term. Also instructive is to consider the case of scalar QED: 


1 
La -4FUF + (DL) (D^ 9) — m?(¢"o) (2.32) 
For Q = | we have: 


(Dud)! (D$) = (9,0) (0%) + ieAyl(O“6)'d — p Op] + e?ApA“O*o 
(2.33) 


We see that for a charged boson in QED, given that the kinetic term for bosons is 
quadratic in the derivative, there is a two-gauge vertex of order e?. Thus in QCD the 
3-gluon vertex is there because the gluon is coloured and the 4-gluon vertex because 
the gluon is a boson. 


2.6 Quantization of a Gauge Theory 


The lagrangian density Ly m in Eq. (2.23) fully describes the theory at the classical 
level. The formulation of the theory at the quantum level requires that a procedure of 
quantization, of regularization and, finally, of renormalization is also specified. To 
start with, the formulation of Feynman rules is not straightforward. A first problem, 
common to all gauge theories, including the abelian case of QED, can be realized 
by observing that the free equation of motion for V4, as obtained from Eqs. ((2.21), 
(2.23)), is given by 


[3?g,, — 8,9,]V^" = 0 (2.34) 


Normally the propagator of the gauge field should be determined by the inverse of 
the operator [07 guv — ðu 3v] which, however, has no inverse, being a projector over 
the transverse gauge vector states. This difficulty is removed by fixing a particular 
gauge. If one chooses a covariant gauge condition 9^ P = 0 then a gauge fixing 
term of the form 


1 
ALor = E > larvi? (2.35) 
A 
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has to be added to the lagrangian (1/4 acts as a lagrangian multiplier). The free 
equations of motion are now modified as follows: 


[87 guy — (1 — 1/3)8,3,]V^" = 0. (2.36) 


This operator now has an inverse whose Fourier transform is given by: 


DAP (q) = is suv t 0 Waa E jgan (2.37) 
q? TI tie 


which is the propagator in this class of gauges. The parameter A can take any value 
and it disappears from the final expression of any gauge invariant, physical quantity. 
Commonly used particular cases are A = 1 (Feynman gauge) and A = 0 (Landau 
gauge). 

While in an abelian theory the gauge fixing term is all that is needed for a 
correct quantization, in a non abelian theory the formulation of complete Feynman 
rules involves a further subtlety. This is formally taken into account by introducing 
a set of D fictitious ghost fields that must be included as internal lines in 
closed loops (Faddeev-Popov ghosts [12]). Given that gauge fields connected by 
a gauge transformation describe the same physics, clearly there are less physical 
degrees of freedom than gauge field components. Ghosts appear, in the form of 
a transformation Jacobian in the functional integral, in the process of elimination 
of the redundant variables associated with fields on the same gauge orbit [13]. 
The correct ghost contributions can be obtained from an additional term in the 
lagrangian density. For each choice of the gauge fixing term the ghost langrangian 
is obtained by eana the effect of an infinitesimal gauge transformation 
Vi C= = Ve — gCAnpcO ny = 8,0€ on the gauge fixing condition. For ət VE = =0 
ane obtains: 


avC = a4 VS — gCagc (0 VP) — 8290€ = — [9?54c + gCancV20")04 
(2.38) 


where the gauge condition 9^ ys — 0 has been taken into account in the last step. 
The ghost lagrangian is then given by: 


ALGhost = Ti [928Ac + gCABc VP 9" ]n^ (2.39) 


where A is the ghost field (one for each index A) which has to be treated as a scalar 
field except that a factor (—1) for each closed loop has to be included as for fermion 
fields. 

Starting from non covariant gauges one can construct ghost-free gauges. An 
example, also important in other respects, is provided by the set of “axial” gauges: 
n^ y = 0 where n,, is a fixed reference 4-vector (actually for n,, spacelike one 


has an axial gauge proper, for n? = 0 one speaks of a light-like gauge and for n u 
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timelike one has a Coulomb or temporal gauge). The gauge fixing term is of the 
form: 


1 
ALGF = o PN In" vp (2.40) 
A 


With a procedure that can be found in QED textbooks [14] the corresponding 
propagator, in Fourier space, is found to be: 


i Mud+"vdu — n^quqv 


AB 
q? 4 ie ICE (nq) (nq)? m 


Di dies 


In this case there are no ghost interactions because n^ V'A, obtained by a gauge 


transformation from n” V$, 
n” y = 0 has been taken into account. Thus the ghosts are decoupled and can 
be ignored. 

The introduction of a suitable regularization method that preserves gauge 
invariance is essential for the definition and the calculation of loop diagrams and for 
the renormalization programme of the theory. The method that is by now currently 
adopted is dimensional regularization [15] which consists in the formulation of the 
theory in n dimensions. All loop integrals have an analytic expression that is actually 
valid also for non integer values of n. Writing the results for n — 4 — e the loops 
are ultraviolet finite for e > 0 and the divergences reappear in the form of poles at 
€ — 0. 


contains no gauge fields, once the gauge condition 


2.7 Spontaneous Symmetry Breaking in Gauge Theories 


The gauge symmetry of the SM was difficult to discover because it is well hidden 
in nature. The only observed gauge boson that is massless is the photon. The gluons 
are presumed massless but cannot be directly observed because of confinement, and 
the W and Z weak bosons carry a heavy mass. Indeed a major difficulty in unifying 
the weak and electromagnetic interactions was the fact that e.m. interactions have 
infinite range (m, = 0), whilst the weak forces have a very short range, owing to 
mwy,z # 0. 

The solution of this problem is in the concept of spontaneous symmetry breaking, 
which was borrowed from statistical mechanics. 

Consider a ferromagnet at zero magnetic field in the Landau-Ginzburg approxi- 
mation. The free energy in terms of the temperature T and the magnetization M can 
be written as 


FM, T) > Fo(T) + 1/2 A2 (T)? + 1/4 A(T) (MP)? +... . (2.42) 
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a b 


Fig. 2.1 The potential V = 1/2 WM? + 1/4 A(M?)? for positive (a) or negative u? (b) (for 
simplicity, M is a 2-dimensional vector). The small sphere indicates a possible choice for the 
direction of M 


This is an expansion which is valid at small magnetization. The neglect of terms of 
higher order in M? is the analogue in this context of the renormalizability criterion. 
Also, A(T) > O is assumed for stability; F is invariant under rotations, i.e. all 
directions of M in space are equivalent. The minimum condition for F reads 


9F/9Mi 20, [u?(T) - A(T)M?|M = 0. (2.43) 


There are two cases, shown in Fig. 2.1. If u? > 0, then the only solution is M = 0, 
there is no magnetization, and the rotation symmetry is respected. In this case the 
lowest energy state (in a quantum theory the vacuum) is unique and invariant under 
rotations. If u? < 0, then another solution appears, which is 


IMo? = —u?/A . (2.44) 


In this case there is a continuous orbit of lowest energy states, all with the same 
value of |M| but different orientations. A particular direction chosen by the vector 
Mp leads to a breaking of the rotation symmetry. 

For a piece of iron we can imagine to bring it to high temperature and let it melt 
in an external magnetic field B. The presence of B is an explicit breaking of the 
rotational symmetry and it induces a non zero magnetization M along its direction. 
Now we lower the temperature while keeping B fixed. The critical temperature Terit 
(Curie temperature) is where uXT) changes sign: pole) = 0. For pure iron 
Terit is below the melting temperature. So at T = Terit iron is a solid. Below Terit we 
remove the magnetic field. In a solid the mobility of the magnetic domains is limited 
and a non vanishing Mo remains. The form of the free energy becomes rotationally 
invariant as in Eq. (2.43). But now the system allows a minimum energy state with 
non vanishing M in the direction where B was. As a consequence the symmetry is 
broken by this choice of one particular vacuum state out of a continuum of them. 

We now prove the Goldstone theorem [16]. It states that when spontaneous 
symmetry breaking takes place, there is always a zero-mass mode in the spectrum. 
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In a classical context this can be proven as follows. Consider a lagrangian 
1 2 
L= znel — V (¢). (2.45) 


The potential V ($) can be kept generic at this stage but, in the following, we will 
be mostly interested in a renormalizable potential of the form (with no more than 
quartic terms): 


1 1 
V($) = eH p + D 9*. (2.46) 


Here by p we mean a column vector with real components ¢; (121,2... N) (complex 
fields can always be decomposed into a pair of real fields), so that, for example, 
SY i $2. This particular potential is symmetric under a NxN orthogonal matrix 
rotation 9^ = Od, where O is a SO(N) transformation. For simplicity, we have 
omitted odd powers of $, which means that we assumed an extra discrete symmetry 
under @ <> —@. Note that, for positive u?, the mass term in the potential has the 
"wrong" sign: according to the previous discussion this is the condition for the 
existence of a non unique lowest energy state. More in general, we only assume 
here that the potential is symmetric under the infinitesimal transformations 


o> o —$4ó0, ó$j— i504 iio; 3 (2.47) 


where 50^ are infinitesimal parameters and tf are the matrices that represent the 
symmetry group on the representation of the fields $; (a sum over A is understood). 
The minimum condition on V that identifies the equilibrium position (or the vacuum 
state in quantum field theory language) is 


(AV /$;)($; = of) — 0. (2.48) 
The symmetry of V implies that 


8V = (ƏV /dG;)5b; = i30^(0V /9))t5; = 0. (2.49) 


By taking a second derivative at the minimum ¢; = $$, given by the previous 
equation, we obtain that, for each A: 


32V 


oV 
IPLI; (Qi = $1505 + — (¢i = dti =0. (2.50) 


86i 
The second term vanishes owing to the minimum condition, Eq. (2.48). We then find 


32V 


35,55, Se= (2.51) 
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The second derivatives M fe = (0°V/dgx 9G) (bi = $?) define the squared mass 
matrix. Thus the above equation in matrix notation can be written as 


M?^i^$9? =0. (2.52) 


In the case of no spontaneous symmetry breaking the ground state is unique, all 
symmetry transformations leave it invariant, so that, for all A, t4¢° = 0. On the 
contrary, if, for some values of A, the vectors (^99) are non-vanishing, i.e. there 
is some generator that shifts the ground state into some other state with the same 
energy (hence the vacuum is not unique), then each 146° ¥ 0 is an eigenstate 
of the squared mass matrix with zero eigenvalue. Therefore, a massless mode is 
associated with each broken generator. The charges of the massless modes (their 
quantum numbers in quantum language) differ from those of the vacuum (usually 
taken as all zero) by the values of the t^ charges: one says that the massless modes 
have the same quantum numbers of the broken generators, i.e. those that do not 
annihilate the vacuum. 

The previous proof of the Goldstone theorem has been given in the classical case. 
In the quantum case the classical potential corresponds to tree level approximation 
of the quantum potential. Higher order diagrams with loops introduce quantum 
corrections. The functional integral formulation of quantum field theory [13, 17] 
is the most appropriate framework to define and compute, in a loop expansion, 
the quantum potential which specifies, exactly as described above, the vacuum 
properties of the quantum theory. If the theory is weakly coupled, e.g. if A is small, 
the tree level expression for the potential is not too far from the truth, and the 
classical situation is a good approximation. We shall see that this is the situation 
that occurs in the electroweak theory if the Higgs is moderately light (see Chap. 3, 
Sect. 3.13.1). 

We note that for a quantum system with a finite number of degrees of freedom, for 
example one described by the Schródinger equation, there are no degenerate vacua: 
the vacuum is always unique. For example, in the one dimensional Schródinger 
problem with a potential: 


V(x) =u? mx Ax, (2.53) 


there are two degenerate minima at x = +xo = 4/(u?/A) which we denote by 
|--) and |—). But the potential is not diagonal in this basis: the off diagonal matrix 
elements: 


(+|V|—) = (-|V|+) ^ exp (—khd) = 6 (2.54) 


are different from zero due to the non vanishing amplitude for a tunnel effect 
between the two vacua, proportional to the exponential of the product of the distance 
d between the vacua and the height h of the barrier with k a constant (see Fig. 2.2). 
After diagonalization the eigenvectors are (|+) + I- 0/42 and (|+) — I-9/A/2, 
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Fig. 2.2 A Schrödinger 
potential V (x) analogous to 
the Higgs potential 


with different energies (the difference being proportional to 5). Suppose now that 
you have a sum of n equal terms in the potential, V = 7, V(x;). Then the 
transition amplitude would be proportional to 6” and would vanish for infinite n: the 
probability that all degrees of freedom together jump over the barrier vanishes. In 
this example there is a discrete number of minimum points. The case of a continuum 
of minima is obtained, always in the Schródinger context, if we take 


y =1/2 pr? + 1/40}, (2.55) 


with r — (x,y,z). Also in this case the ground state is unique: it is given by 
a state with total orbital angular momentum zero, an s-wave state, whose wave 
function only depends on |r|, independent of all angles. This is a superposition of 
all directions with the same weight, analogous to what happened in the discrete 
case. But again, if we replace a single vector r, with a vector field M(x), that is a 
different vector at each point in space, the amplitude to go from a minimum state in 
one direction to another in a different direction goes to zero in the limit of infinite 
volume. In simple words, the vectors at all points in space have a vanishing small 
amplitude to make a common rotation, all together at the same time. In the infinite 
volume limit all vacua along each direction have the same energy and spontaneous 
symmetry breaking can occur. 

The massless Goldstone bosons correspond to a long range force. Unless the 
massless particles are confined, as for the gluons in QCD, these long range forces 
would be easily detectable. Thus, in the construction of the EW theory we cannot 
accept massless physical scalar bosons. Fortunately, when spontaneous symmetry 
breaking takes place in a gauge theory, the massless Goldstone modes exist, but they 
are unphysical and disappear from the spectrum. Each of them becomes, in fact, the 
third helicity state of a gauge boson that takes mass. This is the Higgs mechanism 
(it should be called Englert-Brout-Higgs mechanism [18], because an equal merit 
should be credited to the simultaneous paper by Englert and Brout). Consider, for 
example, the simplest Higgs model described by the lagrangian 


l à 
L= —4 Fav +1@u + iep OA? uh do — SOO (2.56) 
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Note the ‘wrong’ sign in front of the mass term for the scalar field $, which 
is necessary for the spontaneous symmetry breaking to take place. The above 
lagrangian is invariant under the U (1) gauge symmetry 


Ay > Al, — Ay — 30E), o> d =explieQA(x)] 9. (2.57) 


For the U(1) charge Q we take Od = —4, like in QED, where the particle is e~. 
Let $9 = v Æ 0, with v real, be the ground state that minimizes the potential and 
induces the spontaneous symmetry breaking. In our case v is given by v? = u?/à. 
Making use of gauge invariance, we can do the change of variables 


(x) > [v h(x)/V2] expl—ig (x)/vv2] , 
Au (x) > Ay — 8uo(x)/evV2. (2.58) 


Then h = O is the position of the minimum, and the lagrangian becomes 
1 1 
f= nom Lev AL SEWA, + vae hvA? + L(h) . (2.59) 


The field ¢ (x) is the would-be Goldstone boson, as can be seen by considering only 
the ġ terms in the lagrangian, i.e. setting A,, = 0 in Eq. (2.56). In fact in this limit the 
kinetic term 0,70" 7 remains but with no ¢? mass term. Instead, in the gauge case 
of Eq. (2.56), after changing variables in the lagrangian, the field ¢(x) completely 
disappears (not even the kinetic term remains), whilst the mass term gu for 


A,, is now present: the gauge boson mass is M = V/2ev. The field h describes the 
massive Higgs particle. Leaving a constant term aside, the last term in Eq. (2.59) is 
given by: 


1 
Lih) = zü, hü"h — WE +... (2.60) 


where the dots stand for cubic and quartic terms in h. We see that the h mass term 
has the "right" sign, due to the combination of the quadratic terms in A that, after 
the shift, arise from the quadratic and quartic terms in $. The h mass is given by 
Z 5,2 
m, = 2°. 
The Higgs mechanism is realized in well-known physical situations. It was actu- 
ally discovered in condensed matter physics by Anderson [19]. For a superconductor 
in the Landau-Ginzburg approximation the free energy can be written as 


F = Fo + E + |(V — 2ieA)9l?/Am — ald|* + B|ol^ . (2.61) 


Here B is the magnetic field, ||? is the Cooper pair (e^ e^) density, 2e and 2m 
are the charge and mass of the Cooper pair. The ‘wrong’ sign of œ leads to $ 4 0 
at the minimum. This is precisely the non-relativistic analogue of the Higgs model 
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of the previous example. The Higgs mechanism implies the absence of propagation 
of massless phonons (states with dispersion relation œ = kv with constant v). 
Also the mass term for A is manifested by the exponential decrease of B inside the 
superconductor (Meissner effect). 


2.8 Quantization of Spontaneously Broken Gauge Theories: 
Rg Gauges 


We have discussed in Sect. 2.6 the problems arising in the quantization of a 
gauge theory and in the formulation of the correct Feynman rules (gauge fixing 
terms, ghosts etc.). Here we give a concise account of the corresponding results 
for spontaneously broken gauge theories. In particular we describe the Rz gauge 
formalism [13, 17, 20]: in this formalism the interplay of transverse and longitudinal 
gauge boson degrees of freedom is made explicit and their combination leads to the 
cancellation from physical quantities of the gauge parameter $. We work out in 
detail an abelian example that later will be easy to generalize to the non abelian 
case. 

We restart from the abelian model of Eq. (2.56) (with Q — — 1). In the treatment 
presented there the would be Goldstone boson ¢(x) was completely eliminated 
from the lagrangian by a non linear field transformation formally identical to a 
gauge transformation corresponding to the U (1) symmetry of the lagrangian. In that 
description, in the new variables we eventually obtain a theory with only physical 
fields: a massive gauge boson A,, with mass M — A/2ev and a Higgs particle h with 
mass mj = 4/2. This is called a “unitary” gauge, because only physical fields 
appear. But for a massive gauge boson the propagator: 


yv — kk, / M? 

iD, (k) = HEIL , (2.62) 
has a bad ultraviolet behaviour due to the second term in the numerator. This 
choice does not prove to be the most convenient for a discussion of the ultraviolet 
behaviour of the theory. Alternatively one can go to an alternative formulation where 
the would be Goldstone boson remains in the lagrangian but the complication of 
keeping spurious degrees of freedom is compensated by having all propagators with 
good ultraviolet behaviour (“renormalizable” gauges). To this end we replace the 
non linear transformation for $ in Eq. (2.58) with its linear equivalent (after all 
perturbation theory deals with the small oscillations around the minimum): 


p(x) [v + h(x)/V2] expl—it(x)/vV2] ~ [v + hG3/N2 it]. 
(2.63) 


Here we have only applied a shift by the amount v and separated the real and 
imaginary components of the resulting field with vanishing vacuum expectation 
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value. If we leave A, as it is and simply replace the linearized expression for @, 
we obtain the following quadratic terms (those important for propagators): 


quad = —4 D FA F^ + IMA, AM + 


1 
+ 5 Out)? + MA, a" + 5 uh)” ah (2.64) 


The mixing term between A,, and 0,,¢ does not allow to directly write diagonal mass 
matrices. But this mixing term can be eliminated by an appropriate modification of 
the covariant gauge fixing term given in Eq. (2.35) for the unbroken theory. We now 
take: 


ALGF = E —EMzty. (2.65) 


By adding A Zgr to the quadratic terms in Eq. (2.64) the mixing term cancels (apart 
from a total derivative that can be omitted) and we have: 


quad = —7 23 FF 4 IMA, A" — se OA + 
£504 -ËM + EOM? — Py? (2.66) 


We see that the ¢ field appears with a mass VEM and its propagator is: 


i 


i De = ————————. 2.67 
UT = REM? ie Sem 
The propagators of the Higgs field h and of gauge field A,, are: 
i D i (2.68) 
iD, = ——————, : 
h= Lu + ie 
; =i 
iD,,(k) = BoM aie 8H -0-677 DD (2.69) 


As anticipated, all propagators have a good behaviour at large k*. This class of 
gauges are called “Re gauges" [20]. Note that for § = 1 we have a sort of 
generalization of the Feynman gauge with a Goldstone of mass M and a gauge 
propagator: 


—i guv 


E E e 


(2.70) 
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Also for E — oo the unitary gauge description is recovered in that the Goldstone 
propagator vanishes and the gauge propagator reproduces that of the unitary gauge 
in Eq. (2.62). All & dependence, including the unphysical singularities of the ¢ and 
A „ propagators at k? = EM”, present in individual Feynman diagrams, must cancel 
in the sum of all contributions to any physical quantity. 

An additional complication is that a Faddeev-Popov ghost is also present in Rz 
gauges (while it is absent in an unbroken abelian gauge theory). In fact under an 
infinitesimal gauge transformation with parameter 0 (x): 


Au > Ay — 940 
o > (0 — ieO)[v + h(x)/V2 — it(x)/V2], (2.71) 

so that: 
8A&,-— —89,0, óh— —et0, 8t —e042(v - h/A2) . (2.72) 


The gauge fixing condition 0, A^ — €M¢ = 0 undergoes the variation: 


9,A" — EME > ð A! — EME — [2 - EM? (1 - h/v/2)]0 , (2.73) 


where we used M — 4/2ev. From this, recalling the discussion in Sect. 2.6, we 
see that the ghost is not coupled to the gauge boson (as usual for an abelian gauge 
theory) but has a coupling to the Higgs field h. The ghost lagrangian is: 


ALGhost = ÑL? + £M? (1 + h/v/2)]n . Q.74) 
The ghost mass is seen to be mgn = ./EM and its propagator is: 


i 


n 2.75 
k2 — EM? +ie (aara 


iDgn = 


The detailed Feynman rules follow from all the basic vertices involving the gauge 
boson, the Higgs, the would be Goldstone boson and the ghost and can be easily 
derived, with some algebra, from the total lagrangian including the gauge fixing 
and ghost additions. The generalization to the non abelian case is in principle 
straightforward, with some formal complications involving the projectors over the 
space of the would be Goldstone bosons and over the orthogonal space of the Higgs 
particles. But for each gauge boson that takes mass M, we still have a corresponding 
would be Goldstone boson and a ghost with mass /$ M,. The Feynman diagrams, 
both for the abelian and the non abelian case, are listed explicitly, for example, in 
the Cheng and Li textbook in ref.[17]. 

We conclude that the renormalizability of non abelian gauge theories, also in 
presence of spontaneous symmetry breaking, was proven in the fundamental works 
of t' Hooft and Veltman [21] and discussed in detail in [22]. 
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Chapter 3 A 
The Standard Model of Electroweak Geek for 
Interactions 


Guido Altarelli and Stefano Forte 


3.1 Introduction 


In this chapter,! we summarize the structure of the standard EW theory [1] 
and specify the couplings of the intermediate vector bosons W~, Z and of the 
Higgs particle with the fermions and among themselves, as dictated by the gauge 
symmetry plus the observed matter content and the requirement of renormalizability. 
We discuss the realization of spontaneous symmetry breaking and of the Higgs 
mechanism [2]. We then review the phenomenological implications of the EW 
theory for collider physics (that is we leave aside the classic low energy processes 
that are well described by the “old” weak interaction theory (see, for example, [3])). 
Moreover, a detailed description of experiments for precision tests of the EW theory 
is presented in Chap. 6. 

For this discussion we split the lagrangian into two parts by separating the terms 
with the Higgs field: 


L= L gauge F LHiggs . (3.1) 


Both terms are written down as prescribed by the SU (2) & U(1) gauge symmetry 
and renormalizability, but the Higgs vacuum expectation value (VEV) induces the 
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spontaneous symmetry breaking responsible for the non vanishing vector boson and 
fermion masses. 


3.2 The Gauge Sector 


We start by specifying Leauge, which involves only gauge bosons and fermions, 
according to the general formalism of gauge theories discussed in Chap. 2: 


3 
1 1 =, - 0. 
Leauge = “4 b» pr i qb B" + VLiy" Duy, + Vniy" Dur . 
A=1 
(3.2) 


This is the Yang-Mills lagrangian for the gauge group SU (2) & U (1) with fermion 
matter fields. Here 


B,,—9,B, —8,B, and Fj,—9,W; —9,W; — geanc WEWE — (33) 


are the gauge antisymmetric tensors constructed out of the gauge field B, associated 
with U(1), and wi corresponding to the three SU (2) generators; eAgc are the 
group structure constants (see Eqs. (3.8, 3.9)) which, for SU (2), coincide with the 
totally antisymmetric Levi-Civita tensor (recall the familiar angular momentum 
commutators). The normalization of the SU(2) gauge coupling g is therefore 
specified by Eq. (3.3). 

The fermion fields are described through their left-hand and right-hand compo- 
nents: 


Wir — [cr y3/21y, ver = VIO + ys)/21, (3.4) 


with ys and other Dirac matrices defined as in the book by Bjorken-Drell [4]. In 
s 2. T un . . 
particular, y£ = 1, ys = ys. Note that, as given in Eq. (3.4), 


V = yi r = V? EO — ys)/21yo = wold — ys)/21yo = VIC + y5)/2]. 


The matrices P+ = (1 + ys)/2 are projectors. They satisfy the relations P+ P+ = 
Ps, Pt P= = 0, P} + P- = 1. 

The sixteen linearly independent Dirac matrices can be divided into ys-even and 
ys-odd according to whether they commute or anticommute with ys. For the ys- 
even, we have 


VIgV = YL EYR + Yr EYL — (TES Lvs, ow), (3.5) 
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whilst for the ys-odd, 


VTov = YL oyL +Yr oyr = (To = yu Yays) - (3.6) 


The standard EW theory is a chiral theory, in the sense that Yz and yr behave 
differently under the gauge group (so that parity and charge conjugation non 
conservation are made possible in principle). Thus, mass terms for fermions (of 
the form vp Vg + h.c.) are forbidden in the symmetric limit. In particular, in the 
Minimal Standard Model (MSM: i.e. the model that only includes all observed 
particles plus a single Higgs doublet), all v; are SU (2) doublets while all v 
are singlets. But for the moment, by v; & we mean column vectors, including 
all fermion types in the theory that span generic reducible representations of 
SU(2) & U(1). 

In the absence of mass terms, there are only vector and axial vector interactions 
in the lagrangian and those have the property of not mixing v; and wr. Fermion 
masses will be introduced, together with W~ and Z masses, by the mechanism of 
symmetry breaking. The covariant derivatives D, y, g are explicitly given by 


3 


: sgl 
D,VL,R = l^ tig > th We + T T WLR > (3.7) 
A=1 


where E p and 1/2Y; g are the SU (2) and U(1) generators, respectively, in the 
reducible representations wz g. The commutation relations of the SU (2) generators 
are given by 


A ,B ; C A ,B s C 
[tp tp] =i €aBcty and ltr, tr] =i€aBctp . (3.8) 


We use the normalization (3.8) [in the fundamental representation of SU (2)]. The 
electric charge generator Q (in units of e, the positron charge) is given by 


Q= +1/2YL =t} 4 12Yg. (3.9) 


Note that the normalization of the U (1) gauge coupling g' in (3.7) is now specified 
as a consequence of (3.9). Note that th Wr = 0, given that, for all known quark and 
leptons, Vg is a singlet. But in the following, we keep th Vg for generality, in case 
1 day a non singlet right-handed fermion is discovered. 


3.3 Couplings of Gauge Bosons to Fermions 


All fermion couplings of the gauge bosons can be derived directly from Eqs. (3.2) 
and (3.7). The charged W,, fields are described by wo while the photon A,, and 


weak neutral gauge boson Z, are obtained from combinations of Wi and B,,. The 
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charged-current (CC) couplings are the simplest. One starts from the wi? terms in 
Eqs. (3.2) and (3.7) which can be written as: 


ga WI PW) = g [e + i?) /V2\(W) — W2)/2] + h.c.] 


= g [trt w/2) e ne. . (3.10) 


where t+ = t! tit? and W* = (W! £iW?)/4/2. By applying this generic relation 
to L and R fermions separately, we obtain the vertex 


Viyw = £V Yu [eti 5a — ys)/2 + (r$ /N 2) + ¥5)/2| VW, - h.c. 
(3.11) 
Given that £g = 0 for all fermions in the SM, the charged current is pure V — A. 


In the neutral-current (NC) sector, the photon A,, and the mediator Z, of the weak 
NC are orthogonal and normalized linear combinations of B,, and Wi: 


Ay = cosOw By + sin 0w W? A 
Z, = — sin Ow B, + cosówW; . (3.12) 
and conversely: 


Ww; = sin 0w A, + cos ôw Zp , 


By, = cos0w Ap — sin ôw Zp. (3.13) 


Equations (3.12) define the weak mixing angle Ow. We can rewrite the w3 and B, 
terms in Eqs. (3.2) and (3.7) as follows: 


g^ Wi --gY/2B, = [gt sinów + g'(Q — ?)cos6w]A, + 
+ [gt? cosów — g'(Q — t^) sinOw1Z, , (3.14) 
where Eq. (3.9) for the charge matrix Q was also used. The photon is characterized 
by equal couplings to left and right fermions with a strength equal to the electric 
charge. Thus we immediately obtain 
g sindw = g’ cosOw =e, (3.15) 


or equivalently, 


tg ow = a'/g 3.16) 
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Once 0w has been fixed by the photon couplings, it is a simple matter of algebra to 
derive the Z couplings, with the result 


es 8 n 3 3 Lg 
Viuz = 2cosoy eA — ys) + tR(1 + ys) — 2Q sinf 0w]v Z“ , (3.17) 


where Vow z isa notation for the vertex. Once again, recall that in the MSM, t$ =0 
andi? = +1/2. 

In order to derive the effective four-fermion interactions that are equivalent, at 
low energies, to the CC and NC couplings given in Eqs. (3.11) and (3.17), we 
anticipate that large masses, as experimentally observed, are provided for W~ and 
Z by Luiggs. For left-left CC couplings, when the momentum transfer squared can 
be neglected, with respect to may, in the propagator of Born diagrams with single 
W exchange (see, for example, the diagram for u decay in Fig. 3.1, from Eq. (3.11) 
we can write 


2 
a - [V yuQ. — vsti wily" (1 — yat V] - (3.18) 
W 


By specializing further in the case of doublet fields such as v; — e~ or vy — U7, 
we obtain the tree-level relation of g with the Fermi coupling constant G r precisely 
measured from jz decay (see Chap. 2, Eqs. (2), (3)): 


Gr/V2 = g?/8m3, . (3.19) 
By recalling that g sin 0w = e, we can also cast this relation in the form 
mw = MBom/ Sin Ow , (3.20) 
with 
UBom = Gra /A/2G p)? ~ 37.2802 GeV , (3.21) 
where o is the fine-structure constant of QED (o = e? /4n = 1/137.036). 


In the same way, for neutral currents we obtain in Born approximation from 
Eq. (3.17) the effective four-fermion interaction given by 


LY ~ V2 Groot yu - Try" E. dv, (3.22) 
Fig. 3.1 The Born diagram Vu Ve 


for u decay ^ É 
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where 
[..]5(20 — vs) - #3201 + ys) — 20 sin’ Ow (3.23) 
and 
2 
m 
00 = —— LL . (3.24) 
mz cost Ow 


All couplings given in this section are obtained at tree level and are modified in 
higher orders of perturbation theory. In particular, the relations between my and 
sin Ow (Eqs. (3.20) and (3.21)) and the observed values of o (o = po at tree level) 
in different NC processes, are altered by computable EW radiative corrections, as 
discussed in Sect. (3.11). 

The partial width F(W — f f^) is given in Born approximation by the simplest 
diagram in Fig. 3.2 and one readily obtains from Eq. (3.11) with tg = 0, in the limit 
of neglecting the fermion masses and summing over all possible f' for a given f: 


- Grm? amw 
r(W > ff) =N W = Nc———,, 3.25 
mem a oy vee) 


where Nc = 3 or 1 is the number of colours for quarks or leptons, respectively, and 
the relations Eqs. (3.15, 3.19) have been used. Here and in the following expressions 
for the Z widths the one loop QCD corrections for the quark channels can be 
absorbed in a redefinition of Nc: Nc — 3[1 + o5 mz)/zx + ...]. Note that the 
widths are particularly large because the rate already occurs at order g? or Gr. 
The experimental values of the W total width and the leptonic branching ratio (the 
average of e, u and t modes) are [5, 8] (see Chap. 6): 


Iw = 2.147 + 0.060 GeV, B(W — lvi) = 10.80 + 0.09. (3.26) 


The branching ratio B is in very good agreement with the simple approximate 
formula, derived from Eq. (3.25): 


1 
BW — lvi) ^ —————_ ^ 10.896. (3.27) 


2:3 (1 Fo (m2)/z) +3 


Fig. 3.2 Diagrams for (a) the f f 


W and (b) the Z widths in n 1 
Born approximation 


3 The Standard Model of Electroweak Interactions 41 


The denominator corresponds to the sum of the final states d'u, s’C, e ve, 7 Vu, 
t V; (for the definition of d' and s’ see Eq. (3.66)). 
For tr = 0 the Z coupling to fermions in Eq. (3.17) can be cast into the form: 


Viz = osa rne -= elysly y Z" ; (3.28) 

with: 
ej -n! .sy/8) =1-4lQplsin? ow. (3.29) 
and d = +1/2 for up-type or down-type fermions. In terms of g4,y given in 


Eqs. (3.29) (the widths are proportional to (83 + 82)). the partial width r (Z > f f) 
in Born approximation (see the diagram in Fig. 3.2), for negligible fermion masses, 
is given by: 


T(Z > Ff) = Ne— —[ + (1— 41/1 sin? 6w?] 
; 12 sin? 20y i 
Grm? r 
= Nemz Al + (1 — 4/0 f| sin? 6w)?]. (3.30) 


where pp = n / m?, cos? Ow is given in Eq. (3.55). The experimental values of the 
Z total width and of the partial rates into charged leptons (average of e, u and c), 
into hadrons and into invisible channels are [5, 8] (see Chap. 6): 
Tz = 2.4952 + 0.0023 GeV, 
T+- = 83.985 + 0.086 MeV, 
I, = 1744.4+2.0 MeV, 
Tiny = 499.0 + 1.5 MeV. (3.31) 


The measured value of the Z invisible width, taking radiative corrections into 
account, leads to the determination of the number of light active neutrinos (see 
Chap. 6): 


Ny = 2.9841 + 0.0083, (3.32) 


well compatible with the three known neutrinos ve, v,, and vz; hence there exist only 
the three known sequential generations of fermions (with light neutrinos), a result 
with important consequences also in astrophysics and cosmology. 

At the Z peak, besides total cross sections, various types of asymmetries have 
been measured. The results of all asymmetry measurements are quoted in terms of 
the asymmetry parameter A y, defined in terms of the effective coupling constants, 
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el and el. as: 


Pg Pur 
3 
pee Lg Ef E QAM. 33) 


d 2 2 : ? 
sy teh 1+ (87/81) 


The measurements are: the forward-backward asymmetry (At p = (3/4)ACA p), the 
tau polarization (A, ) and its forward backward asymmetry (Ae) measured at LEP, as 
well as the left-right and left-right forward-backward asymmetry measured at SLC 
(Ae and Af, respectively). Hence the set of partial width and asymmetry results 
allows the extraction of the effective coupling constants: widths measure (gi, + 82) 
and asymmetries measure gy /ga. 

The top quark is heavy enough that it can decay into a real bW pair, which is by 
far its dominant decay channel. The next mode, t — s W, is suppressed in rate by a 
factor |V;, ls ~ 1.71072, see Eqs. (3.71-3.73). The associated width, neglecting mp 
effects but including 1-loop QCD corrections in the limit mw = 0, is given by (we 
have omitted a factor Vil? that we set equal to 1): 


Tit > bW?) = 


Grm; Dy o my 5 
1— ——»*(ü042——)I- ————(—— - =) +... ]. 
an JB 225 "eu ( ) ] 


The top quark lifetime is so short, about 0.5: 107^ s, that it decays before hadroniz- 
ing or forming toponium bound states. 


3.4 Gauge Boson Self-interactions 


The gauge boson self-interactions can be derived from the F,,, term in Leauge, by 
using Eq. (3.12) and W* = (W! + iW?)/4/2. 

Defining the three-gauge-boson vertex as in Fig. 3.3 (with all incoming lines), we 
obtain (V = y, Z) 


Vw-w*y = igw-w-*vlguv( — Mat Sul’ — pv t (4 — r)a], (3.35) 
with 
8£w-w-y, —8 sinÜów =e and gw-w+z = 8 cosOw. (3.36) 


Note that the photon coupling to the W is fixed by the electric charge, as imposed 
by QED gauge invariance. The ZWW coupling is larger by a tan Oy factor. This 
form of the triple gauge vertex is very special: in general, there could be departures 
from the above SM expression, even restricting us to Lorentz invariant, em gauge 
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Fig. 3.3 The three- and four-gauge boson vertices. The cubic coupling is of order g, while the 
quartic one is of order g? 
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Fig. 3.4 The three- and four-gauge boson vertices. The cubic coupling is of order g, while the 
quartic one is of order g? 


symmetric and C and P conserving couplings. In fact some small corrections are 
already induced by the radiative corrections. But, in principle, more important could 
be the modifications induced by some new physics effect. The experimental testing 
of the triple gauge vertices has been done mainly at LEP2 and at the Tevatron. At 
LEP2 the crosssection and angular distributions for the process e*e^ —^ W*W- 
have been studied (see Chap. 6). 

In Born approximation the Feynman diagrams for the LEP2 process are shown 
in Fig. 3.4 [6]. Besides neutrino exchange which only involves the well established 
charged current vertex, the triple weak gauge vertices Vy- w- y appear in the y and 
Z exchange diagrams. The Higgs exchange is negligible because the electron mass is 
very small. The analytic cross section formula in Born approximation can be found, 
for example, in Ref. [5]. The experimental data are compared with the SM prediction 
in Chap. 6 [7]. The agreement is very good. Note that the sum of all three exchange 
amplitudes has a better high energy behaviour. This is due to cancellations among 
the amplitudes implied by gauge invariance, connected to the fact that the theory is 
renormalizable (the crosssection can be seen as a contribution to the imaginary part 
of the ete~ — e*e- amplitude). 

The quartic gauge coupling is proportional to g?eAgcW^WCe4pgWP WE. 
Thus in the term with A — 3 we have four charged W's. For A — 1 or two 
we have two charged W's and 2 W?'s, each Ws being a combination of y and Z 
according to Eq. (3.13). With a little algebra the quartic vertex can be cast in the 
form: 


Vwwvv — igwwvvl2guv8rp — Sur8vp — Sup 8val , (3.37) 
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where, u and v refer to Wt W* in the 4W vertex and to VV in the WWV V case 
and: 


gwwww =g, g£WWyy = -e, gwwyz = —egcosO0w, gwwzz = —g? cos” Ow . 
(3.38) 


In order to obtain these result for the vertex the reader must duly take into account 
the factor of —1/4 in front of F A in the lagrangian and the statistical factors 
which are equal to two for each pair of identical particles (like W^ W* or yy, for 
example). The quartic coupling, being quadratic in g, hence small, could not be 
directly tested so far. 


3.5 The Higgs Sector 


We now turn to the Higgs sector of the EW lagrangian. The Higgs lagrangian is 
specified by the gauge principle and the requirement of renormalizability to be 


Luiges = (DLO) (DHG) — V (0t) — VT Vno — aV Vio, (3.39) 


where ¢ is a column vector including all Higgs fields; it transforms as a reducible 
representation of the gauge group. The quantities I (which include all coupling 
constants) are matrices that make the Yukawa couplings invariant under the Lorentz 
and gauge groups. Without loss of generality, here and in the following, we take T 
to be ys-free. The potential V ($9), symmetric under SU (2) & U (1), contains, at 
most, quartic terms in $ so that the theory is renormalizable: 


1 i 
Voto) = -u° p'o + PCR (3.40) 


As discussed in Chap.2, spontaneous symmetry breaking is induced if the 
minimum of V, which is the classical analogue of the quantum mechanical vacuum 
state (both are the states of minimum energy), is obtained for non-vanishing $ 
values. Precisely, we denote the vacuum expectation value (VEV) of 4$, i.e. the 
position of the minimum, by v (which is a doublet): 


(010 ()]0) = v = (2) #0. (3.41) 


The reader should be careful that the same symbol is used for the doublet and the 
only non zero component of the same doublet. The fermion mass matrix is obtained 
from the Yukawa couplings by replacing $ (x) by v: 


M = yi Mig + Vg MI yr, (3.42) 
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with 
M=TI.-v. (3.43) 


In the MSM, where all left fermions yz are doublets and all right fermions Wr are 
singlets, only Higgs doublets can contribute to fermion masses. There are enough 
free couplings in I, so that one single complex Higgs doublet is indeed sufficient to 
generate the most general fermion mass matrix. It is important to observe that by a 
suitable change of basis we can always make the matrix M Hermitian and diagonal. 
In fact, we can make separate unitary transformations on vy, and we according to 


Vp — UL. Vg —Wwyn (3.44) 
and consequently 
M — M' 2 U'MW . (3.45) 


This transformation does not alter the structure of the fermion couplings in Csymm 
(because both the kinetic terms and the couplings to gauge bosons do not mix L 
and R spinors) except that it leads to the phenomenon of mixing, as we shall see in 
Sect. (3.6). 

If only one Higgs doublet is present, the change of basis that makes M diagonal 
will at the same time diagonalize the fermion-Higgs Yukawa couplings. Thus, in 
this case, no flavour-changing neutral Higgs vertices are present. This is not true, 
in general, when there are several Higgs doublets. But one Higgs doublet for each 
electric charge sector i.e. one doublet coupled only to u-type quarks, one doublet to 
d-type quarks, one doublet to charged leptons (and possibly one for neutrino Dirac 
masses) would also be all right, because the mass matrices of fermions with different 
charges are diagonalized separately. For several Higgs doublets in a given charge 
sector it is also possible to generate CP violation by complex phases in the Higgs 
couplings. In the presence of six quark flavours, this CP-violation mechanism is not 
necessary. In fact, at the moment, the simplest model with only one Higgs doublet 
seems adequate for describing all observed phenomena. 

We now consider the gauge-boson masses and their couplings to the Higgs. These 
effects are induced by the (Duo) (Dte) term in Lyiggs (Eq. (3.39)), where 


3 
Dud = E tig Awa + era, $. (3.46) 
A=1 


Here t^ and Y/2 are the SU (2) & U(1) generators in the reducible representation 
spanned by $. Not only doublets but all non-singlet Higgs representations can 
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contribute to gauge-boson masses. The condition that the photon remains massless 
is equivalent to the condition that the vacuum is electrically neutral: 


Q|v) = (@ Y) =0. (3.47) 


We now explicitlly consider the case of a single Higgs doublet: 


+ 
HE) O as 


The charged W mass is given by the quadratic terms in the W field arising from 
LHhiggs, When $ (x) is replaced by v in Eq. (3.41). By recalling Eq. (3.10), we obtain 


mi, Wt W-" = gtv VD WEW , (3.49) 


whilst for the Z mass we get [recalling Eqs. (3.12—3.14)] 
1 
5m ZZ" = |[g cos wt? — g' sin 0w (Y /D Jv? Z, Z^ , (3.50) 


where the factor of 1/2 on the left-hand side is the correct normalization for the 
definition of the mass of a neutral field. By using Eq. (3.47), relating the action of 7? 
and Y /2 on the vacuum v, and Eqs. (3.16), we obtain 


sm = (g cos Ow + g'sinów)^|?^v|? = (g?/ cos? 0w)|?^ v? . (3.51) 
For a Higgs doublet, as in Eq. (3.48), we have 

ttu? = v2, Pw? = 1/402, (3.52) 
so that 
m; = 1/2g7v?, m3 = 1/2g7v"/ cos” Ow . (3.53) 


Note that by using Eq. (3.19) we obtain 


v = 27/^g,7 = 174.1 GeV. (3.54) 
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It is also evident that for Higgs doublets 


2 


my 
p= =z = 1. (3.55) 
mz cos* Ow 


This relation is typical of one or more Higgs doublets and would be spoiled by the 
existence of Higgs triplets etc. In general, 


Yu) — GY t) 
po = St (3.56) 
2,207» 
for several Higgs bosons with VEVs v;, weak isospin t;, and z-component t? ] 
These results are valid at the tree level and are modified by calculable EW radiative 
corrections, as discussed in Sect. (3.7). 
The measured values of the W and Z masses are [5, 8] (see Chap. 6): 


mw = 80.398 + 0.025 GeV, mz = 91.1875 + 0.0021 GeV. (3.57) 


In the minimal version of the SM only one Higgs doublet is present. Then 
the fermion-Higgs couplings are in proportion to the fermion masses. In fact, 
from the Yukawa couplings Soff FLFR + h.c.), the mass mf is obtained by 
replacing $ by v, so that mp = Saf pr: In the minimal SM three out of the four 
Hermitian fields are removed from the physical spectrum by the Higgs mechanism 
and become the longitudinal modes of Wt,W-, and Z. The fourth neutral 
Higgs is physical and should be found. If more doublets are present, two more 
charged and two more neutral Higgs scalars should be around for each additional 
doublet. 

The couplings of the physical Higgs H can be simply obtained from Lyiggs, by 
the replacement (the remaining three hermitian fields correspond to the would be 
Goldstone bosons that become the longitudinal modes of W~ and Z): 


C2) ~ Cada 
$c = (Soon 7 bor r2) ' eee) 


[so that (Duo) (Dto) = 1/2(8,Hy 4T ...], with the results 


2 
£[H, W, Z] = g IW tH + Swit WHA? + 
LIIS nen 8 Zh. (39 
P 2//2cos0y ^ 8cos?üy ^ : ` 


Note that the trilinear couplings are nominally of order g?, but the adimensional 
coupling constant is actually of order g if we express the couplings in terms of the 
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masses according to Eqs. (3.53): 
g 
LIH, W, Z] = gmwWiW "H+ WWW HH + 


Og 
8cos? 0w 


Smz 


LL s Z ZH". 3.60 
* 2 cos? Ow ^ (2200) 


Z,Z" H + 
Thus the trilinear couplings of the Higgs to the gauge bosons are also proportional 
to the masses. The quadrilinear couplings are genuinely of order g?. Recall that to 
go from the lagrangian to the Feynman rules for the vertices the statistical factors 
must be taken into account: for example, the Feynman rule for the ZZH H vertex 
is gug /2 cos? Oy. 

The generic coupling of H to a fermion of type f is given by (after diagonaliza- 
tion): 


LIH, Jj, y] = SH, (3.61) 
with 
Ce Cog ie. (3.62) 


42. vv 


The Higgs self couplings are obtained from the potential in Eq. (3.40) by the 
replacement in Eq. (3.58). Given that, from the minimum condition: 


2 

u 
v=,/— 3.63 
E (3.63) 

one obtains: 

H u H u? u u 
V = -p w+ —Y 4+ t+ St = - 4 "+ H+ 
EOS ae n ee pe m 


(3.64) 


The constant term can be omitted in our context. We see that the Higgs mass is 
positive (compare with Eq. (3.40)) and is given by: 


my 224? = 21? (3.65) 


We see that for VA ~ o(1) the Higgs mass should be of the order of the weak scale. 

The difficulty of the Higgs search is due to the fact that it is heavy and coupled 
in proportion to mass: it is a heavy particle that must be radiated by another heavy 
particle. So a lot of phase space and luminosity is needed. At LEP2 the main process 
for Higgs production was the Higgs-strahlung process ete — ZH shown in 
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Fig. 3.5 Higgs production 
diagrams in Born 


approximation: (a) The EF Z 
Higgs-strahlung process a Nd 
e*e- — ZH,(b) the WW je AN 

ES i *« 


fusion process e* e^ > Hv» 


Fig.3.5 [9]. The alternative process e*e^ — Hvv, via WW fusion, also shown 
in Fig. 3.5 [10], has a smaller crosssection at LEP2 energies but would become 
important, even dominant at higher energy ete~ colliders, like the ILC or CLIC 
(the corresponding ZZ fusion process has a much smaller crosssection). The analytic 
formulae for the crosssections of both processes can be found, for example, in [11]. 
The direct experimental limit on my from LEP2 is my = 114 GeV at 95% c.l. (see 
Chap. 6). 


3.6 The CKM Matrix 


Weak charged currents are the only tree level interactions in the SM that change 
flavour: for example, by emission of a W an up-type quark is turned into a down- 
type quark, or a v; neutrino is turned into a /~ charged lepton (all fermions are 
letf-handed). If we start from an up quark that is a mass eigenstate, emission of a 
W turns it into a down-type quark state d’ (the weak isospin partner of u) that in 
general is not a mass eigenstate. The mass eigenstates and the weak eigenstates do 
not coincide and a unitary transformation connects the two sets: 


d' d 
D'=|s|=V|s|=VD (3.66) 
b' b 


V is the Cabibbo-Kobayashi-Maskawa (CKM) matrix [12] (and similarly we can 
denote by U the column vector of the three up quark mass eigenstates). Thus in 
terms of mass eigenstates the charged weak current of quarks is of the form: 

Ji x Uyy(1 — ysr* VD (3.67) 


where 


V=UlUg (3.68) 
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Here U,, and Ug are the unitary matrices that operate on left-handed doublets in 
the diagonalization of the u and d quarks, respectively (see Eq. (3.44)). Since V 
is unitary (i.e. VV! = V'V = 1) and commutes with T?, T; and Q (because 
all d-type quarks have the same isospin and charge), the neutral current couplings 
are diagonal both in the primed and unprimed basis (if the down-type quark terms 
in the Z current are written in terms of weak isospin eigenvectors as D'T D', 
then by changing basis we get DV'TVD and V and T commute because, as 
seen from Eq. (3.23), T is made of Dirac matrices and of 73 and Q generator 
matrices). It follows that D'T D’ = DID. This is the GIM mechanism [13] that 
ensures natural flavour conservation of the neutral current couplings at the tree 
level. 

For N generations of quarks, V is a NxN unitary matrix that depends on N? 
real numbers (N? complex entries with N? unitarity constraints). However, the 2N 
phases of up- and down-type quarks are not observable. Note that an overall phase 
drops away from the expression of the current in Eq. (3.67), so that only 2N — 1 
phases can affect V. In total, V depends on N? — 2N + 1 = (N — 1)? real physical 
parameters. A similar counting gives N(N — 1)/2 as the number of independent 
parameters in an orthogonal NxN matrix. This implies that in V we have N(N — 
1)/2 mixing angles and (N — 1)? — N(N — 1)/2 = (N — 1)(N — 2)/2 phases: for 
N = 2 one mixing angle (the Cabibbo angle 0c) and no phases, for N = 3 three 
angles (012, 013 and 623) and one phase ¢ etc. 

Given the experimental near diagonal structure of V a convenient parametrisation 
is the one proposed by Maiani [14]. It can be cast in the form of a product of 
three independent 2 x 2 block matrices (s;; and cj; are shorthands for sin6;; and 
cos 6;;): 


1 0 0 C13 0 spel? C12 $5812 0 
V = [0 c3 523 0 1 0 —$12 €120] . (3.69) 
0 —523 €23 —spe 1? 0 cia 0 0 1 


The advantage of this parametrization is that the three mixing angles are of different 
orders of magnitude. In fact, from experiment we know that s12 = A, $23 ~ 0(42) 
and 513 ~ o(A3), where A = sin 6c is the sine of the Cabibbo angle, and, as order 
of magnitude, s;; can be expressed in terms of small powers of X. More precisely, 
following Wolfenstein [15] one can set: 


$12 =A, 5233 = Ad’, size? = AX (p — in) (3.70) 


As a result, by neglecting terms of higher order in A one can write down: 


Vua Vus Vub 5 A A2? (p — in) 
V = | Vea Ves Veb dx —A — x AM + olf). 
Via Vis Vib AX (1 — p — ig) —AX? 1 


(3.71) 
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It has become customary to make the replacement p, n — p, 7 with: 


T ~ (RZ 2 
pM AE (o =i Ad c A^/2 ...). (3.72) 


Present values of the CKM parameters as obtained from experiment are [16] [17] (a 
survey of the current status of the CKM parameters can also be found in Ref. [5]): 
à = 0.2258 + 0.0014 
A = 0.818 + 0.016 
p = 0.164 + 0.029; »=0.340+0.017 (3.73) 


A more detailed discussion of the experimental data is given in Chap. 10. 

In the SM the non vanishing of the 7 parameter (related to the phase in 
Eqs. 3.69 and 3.70) is the only source of CP violation. Unitarity of the CKM matrix 
V implies relations of the form Y. Vba V% = ôbc. In most cases these relations 
do not imply particularly instructive constraints on the Wolfenstein parameters. But 
when the three terms in the sum are of comparable magnitude we get interesting 
information. The three numbers which must add to zero form a closed triangle in the 
complex plane, with sides of comparable length. This is the case for the t-u triangle 
(unitarity triangle) shown in Fig. 3.6 (or, what is equivalent in first approximation, 
for the d-b triangle): 


Via Via + Vis Vis + Vig Vip = 0 (3.74) 


All terms are of order 43. For n = 0 the triangle would flatten down to vanishing 
area. In fact the area of the triangle, J of order J ~ nA7A°, is the Jarlskog invariant 
[18] (Gits value is independent of the parametrization). In the SM all CP violating 
observables must be proportional to J, hence to the area of the triangle or to n. A 
direct and by now very solid evidence for J non vanishing is obtained from the 
measurements of e and e' in K decay. Additional direct evidence is being obtained 
from the experiments on B decays at beauty factories and at the TeVatron where the 
angles £8 (the most precisely measured), œ and y have been determined. Together 
with the available information on the magnitude of the sides all the measurements 


Fig. 3.6 The unitarity 
triangle corresponding to 
Eq. (3.74) 
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Fig. 3.7 Box diagrams d W $ d d 

describing K? — K? mixing 

at the quark level at 1-loop he | W 
uGt 


are in good agreement with the predictions from the SM unitary triangle [16, 17] 
(see Chap. 10). 

As we have discussed, due to the GIM mechanism, there are no flavour changing 
neutral current (FCNC) transitions at tree level in the SM. Transitions with |A F| = 
1,2 are induced at one loop level. In particular, meson mixing, i.e. M — M off 
diagonal | AF| = 2 mass matrix elements (with M = K, D or B neutral mesons), 
are obtained from box diagrams. For example, in the case of K? — K mixing the 
relevant transition is Sd — sd (see Fig. 3.7). In the internal quark lines all up-type 
quarks are exchanged. In the amplitude, two vertices and the connecting propagator 
(with virtual four momentum p,,) at one side contribute a factor (u; = u, c, t): 


| 
ex 
val 
ve 


Foim = 2 Vi us — my — — Vud, (3.75) 


which, in the limit of equal mui, is clearly vanishing due to the unitarity of the CKM 
matrix V. Thus the result is proportional to mass differences. For K? — K? mixing 
the contribution of virtual u quarks is negligible due to the small value of m, and the 
contribution of the t quark is also small due to the mixing factors Vt V;4 ~ 0(A7A°). 
The dominant c quark contribution to the real part of the box diagram quark-level 
amplitude is approximately of the form (see, for example, [19]): 


2 


G 2 2 As=2 
ReHpox = Tera ne Re(V5 Vea) mos , (3.76) 


where 54 ~ 0.85 is a QCD correction factor and oAs=2 = =d; L YuSL SL yy dr is the 
4-quark dimension six relevant operator. To obtain the K? — K mixing its matrix 
element between meson states must be taken which is parametrized in terms of a 
“Bx parameter" which is defined in such a way that Bg = 1 for vacuum state 
insertion between the two currents: 


E 16 
ioo s z fm Bk (3.77) 


where fx ~ 113MeV is the kaon pseudoscalar constant. Clearly to the charm 
contribution in Eq. (3.76) non perturbative additional contributions must be added, 
some of them of o(m5, / m2), because the smallness of m, makes a completely 
partonic dominance inadequate. In particular, By is best evaluated by QCD lattice 
simulations. In Eq. (3.76) the factor o(m? / mi) is the “GIM suppression" factor 
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Fig. 3.8 Examples of |A F| = 1 transitions at the quark level at 1-loop: (a) Diagram fora Z —^ tc 
vertex, (b) b — s y, (c) a "penguin" diagram for b > s eT e7 


(1/ mns is hidden in Gr according to Eq. (3.19)). For B mixing the dominant 
contribution is from the t quark. In this case, the partonic dominance is more realistic 
and the GIM factor o(m? / my) is actually larger than one. 

All sorts of transitions with | A F| = 1 are also induced at loop level. For example, 
an effective vertex Z — tc, which does not exist at tree level, is generated at 1-loop 
(see Fig. 3.8). Similarly, transitions involving photons or gluons are also possible, 
like t — cgorb— s y (Fig.3.8) or b — s g. For light fermion exchange 
in the loop the GIM suppression is also effective in |AF| — 1 amplitudes. For 
example, analogous leptonic transitions like u — e y ort — u y also exist but 
are extremely small in the SM because the tiny neutrino masses enter in the GIM 
suppression factor. But new physics effects could well make these rare processes 
accessible to experiments in the near future. The external Z, photon or gluon can be 
attached to a pair of light fermions, giving rise to an effective four fermion operator, 
as in “penguin diagrams" like the one shown in Fig.3.8 for b — s [*1^. The 
inclusive rate B — X; y with X, a hadronic state containing a unit of strangeness 
corresponding to an s-quark, has been precisely measured. The world average result 
for the branching ratio with E, > 1.6 GeV is [5]: 


B(B > X; y)exp = (3.55 0.2610 ^. (3.78) 


The theoretical prediction for this inclusive process is to a large extent free of 
uncertainties from hadronisation effects and is accessible to perturbation theory as 
the b-quark is heavy enough. The most complete result at order a2 is at present [20] 
(and refs. therein): 


B(B — X; y)ih = (2.98 £0.26) 10-4 . (3.79) 


Note that the theoretical value has recently become smaller than the experimental 
value. The fair agreement between theory and experiment imposes stringent con- 
straints on possible new physics effects. 
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3.7 Neutrino Masses 


In the minimal version of the SM the right handed neutrinos v;r, which have no 
gauge interactions, are not present at all. With no ve no Dirac mass is possible 
for neutrinos. If lepton number conservation is also imposed, then no Majorana 
mass is allowed either and, as a consequence, all neutrinos are massless. But, at 
present, from neutrino oscillation experiments (see Chapter 11 of the present work), 
we know that at least 2 out of the 3 known neutrinos have non vanishing masses: 
the two mass squared differences measured from solar (Am?) and atmospheric 
oscillations (Am24) are given by Am?, ~ 8 10? eV? and Am}, ~ 2.5 10? 
[21]. The absolute values of the masses are very small, with an upper limit of a 
fraction of eV, obtained from laboratory experiments (tritium £ decay near the end 
point: m, < 2 eV [5], absence of visible neutrinoless double 8 decay : |mee| S 
0.3—0.7 eV (mee is a combination of neutrino masses; for a review, see, for example 
[22]) and from cosmological observations: m, < 0.1 — 0.7 eV (depending on the 
cosmological model assumptions) [23]. If v;ę are added to the minimal model and 
lepton number is imposed by hand, then neutrino masses would in general appear as 
Dirac masses, generated by the Higgs mechanism, like for any other fermion. But, 
for Dirac neutrinos, to explain the extreme smallness of neutrino masses, one should 
allow for very small Yukawa couplings. However, we stress that, in the SM, baryon 
B andlepton L number conservation, which are not guaranteed by gauge symmetries 
(as is the case for the electric charge Q), are understood as "accidental" symmetries, 
due to the fact that, out of the SM fields, it is not possible to construct gauge invariant 
operators which are renormalizable (i.e. of operator dimension d < 4) and violate 
B and/or L. In fact the SM lagrangian should contain all terms allowed by gauge 
symmetry and renormalizability. The most general renormalizable lagrangian, built 
from the SM fields, compatible with the SM gauge symmetry, in absence of vig, is 
automatically B and L conserving. But in presence of vig, this is no more true and 
the right handed Majorana mass term is allowed: 


MRR = Vig MijVjR = vigCMijVjR ; (3.80) 


where vip = C V. is the charge conjugated neutrino field and C is the charge 
conjugation matrix in Dirac spinor space. The Majorana mass term is an operator 
of dimension d = 3 with AL = 2. Since the vj; are gauge singlets the Majorana 
mass Mgm is fully allowed by the gauge symmetry and a coupling with the Higgs is 
not needed to generate this type of mass. As a consequence, the entries of the mass 
matrix Mj; do not need to be of the order of the EW symmetry breaking scale v and 
could be much larger. If one starts from the Dirac and RR Majorana mass terms for 
neutrinos, the resulting mass matrix, in the L, R space, has the form: 


m, = | " 2 (3.81) 
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where mp and M are the Dirac and Majorana mass matrices (M is the matrix Mj; 
in Eq. (3.80)). The corresponding eigenvalues are three very heavy neutrinos with 
masses of order M and three light neutrinos with masses 


my, = —mLM imp ; (3.82) 


which are possibly very small if M is large enough. This is the see-saw mechanism 
for neutrino masses [24]. Note that if no vjr exist a Majorana mass term could 
still be built out of vjz. But vjz have weak isospin 1/2, being part of the left 
handed lepton doublet /. Thus, the left handed Majorana mass term has total weak 
isospin equal to one and needs two Higgs fields to make a gauge invariant term. The 
resulting mass term: 


Os = AT XjljHH/M , (3.83) 


with M a large scale (apriori comparable to the scale of Mrr) and A a dimensionless 
coupling generically of o(1), is a non renormalizable operator of dimension 5. The 
corresponding mass terms are of the order m, ~ àv? / M, hence of the same generic 
order of the light neutrino masses from Eq. (3.82). 

In conclusion, neutrino masses are believed to be small because neutrinos are 
Majorana particles with masses inversely proportional to the large scale M of energy 
where L non conservation is induced. It is interesting that the observed magnitudes 
of the mass squared splittings of neutrinos are well compatible with a scale M 
remarkably close to the Grand Unification scale, where in fact L non conservation 
is naturally expected. 

In the previous Section we have discussed flavour mixing for quarks. But, clearly, 
given that non vanishing neutrino masses have been established, a similar mixing 
matrix is also introduced in the leptonic sector, but will not be discussed here (see 
Chapter 11). 


3.8 Renormalization of the Electroweak Theory 


The Higgs mechanism gives masses to the Z, the W~ and to fermions while the 
lagrangian density is still symmetric. In particular the gauge Ward identities and the 
symmetric form of the gauge currents are preserved. The validity of these relations 
is an essential ingredient for renormalizability. In the previous Sections we have 
specified the Feynman vertices in the “unitary” gauge where only physical particles 
appear. However, as discussed in Chap. 2, in this gauge the massive gauge boson 
propagator would have a bad ultraviolet behaviour: 


(3.84) 
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A formulation of the standard EW theory with good apparent ultraviolet behaviour 
can be obtained by introducing the renormalizable or Rg gauges, in analogy with 
the abelian case discussed in detail in Chap. 2. One parametrizes the Higgs doublet 


as: 
^) i F S —iw" 
E E f = siz}; (3.85) 
i (5 $3 + ida Uu = 
and similarly for $^, where w~ appears. The scalar fields w+ and z are the pseudo 


Goldstone bosons associated with the longitudinal modes of the physical vector 
bosons W= and Z. The Rg gauge fixing lagrangian has the form: 


1 1 1 
A£gr = lo^ Wa — Emw wl? — 2; Q^ Za - nmz) - 55 NAW . (3.86) 


The W~ and Z propagators, as well as those of the scalars w~ and z, have exactly 
the same general forms as for the abelian case in Eqs. (67)-(69) of Chap. 2, with 
parameters £ and n, respectively (and the pseudo Goldstone bosons w~ and z have 
masses ám y and nmz). In general, a set of associated ghost fields must be added, 
again in direct analogy with the treatment of Rg gauges in the abelian case of 
Chap. 2. The complete Feynman rules for the standard EW theory can be found 
in a number of textbooks (see, for example, [25]). 

The pseudo Goldstone bosons w~ and z are directly related to the longitudinal 
helicity states of the corresponding massive vector bosons W~ and Z. This 
correspondence materializes in a very interesting “equivalence theorem”: at high 
energies of order E the amplitude for the emission of one or more longitudinal gauge 
bosons Vz (with V = W, Z) becomes equal (apart from terms down by powers of 
m y / E) to the amplitude where each longitudinal gauge boson is replaced by the 
corresponding Goldstone field w- or z [26]. For example, consider top decay with 
a longitudinal W in the final state: t — bwr . The equivalence theorem asserts that 
we can compute the dominant contribution to this rate from the simpler t > bwt 
matrix element: 


Tt > bWF) 2T (t > bw*)[1 + o(my/m2)] . (3.87) 


In fact one finds: 


h2 G 3 
T(t > bw*) = —m, = EM 


= 3.88 
325 | Br? uin 


where h; = m;/v is the Yukawa coupling of the top quark (numerically very close 
to 1), and we used 1/v? —242Gr (see Eq. (3.54)). If we compare with Eq. (3.34), 
we see that this expression coincides with the total top width (i.e. including all 
polarizations for the W in the final state), computed at tree level, apart from terms 
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down by powers of o(myy / m?). In fact, the longitudinal W is dominant in the final 
state because h; >> g*. Similarly the equivalence theorem can be applied to find 
the dominant terms at large ./s for the crosssection e^ e^ > wt W; , or the leading 
contribution in the limit my >> my to the width for the decay l'(H — VV). 

The formalism of the Re gauges is also very useful in proving that spontaneously 
broken gauge theories are renormalizable. In fact, the non singular behaviour of 
propagators at large momenta is very suggestive of the result. Nevertheless to 
prove it is by far not a simple matter. The fundamental theorem that in general a 
gauge theory with spontaneous symmetry breaking and the Higgs mechanism is 
renormalizable was proven by 't Hooft and Veltman [27, 28]. 

For a chiral theory like the SM an additional complication arises from the 
existence of chiral anomalies. But this problem is avoided in the SM because the 
quantum numbers of the quarks and leptons in each generation imply a remarkable 
(and, from the point of view of the SM, mysterious) cancellation of the anomaly, 
as originally observed in Ref.[29]. In quantum field theory one encounters an 
anomaly when a symmetry of the classical lagrangian is broken by the process of 
quantization, regularization and renormalization of the theory. Of direct relevance 
for the EW theory is the Adler-Bell-Jackiw (ABJ) chiral anomaly [30]. The classical 
lagrangian of a theory with massless fermions is invariant under a U(1) chiral 
transformations ys = e/Y5? y. The associated axial Noether current is conserved 
at the classical level. But, at the quantum level, chiral symmetry is broken due to the 
ABJ anomaly and the current is not conserved. The chiral breaking is produced by a 
clash between chiral symmetry, gauge invariance and the regularization procedure. 

The anomaly is generated by triangular fermion loops with one axial and two 
vector vertices (Fig. 3.9). For example, for the Z the axial coupling is proportional 
to the third component of weak isospin t5, while the vector coupling is proportional 
to a linear combination of t3 and the electric charge Q. Thus in order for the chiral 
anomaly to vanish all traces of the form tr(t3 QQ}, tr{t3t3 Q}, tr {t3t3t3} (and also 
tr{t,t_t3} when charged currents are also included) must vanish, where the trace 
is extended over all fermions in the theory that can circulate in the loop. Now all 
these traces happen to vanish for each fermion family separately. For example take 
tr(t3 Q Q}. In one family there are, with t3 = +1/2, three colours of up quarks with 


Fig. 3.9 Triangle diagram 
that generates the ABJ 
anomaly 
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charge Q = +2/3 and one neutrino with Q = 0 and, with #3 = —1/2, three colours 
of down quarks with charge Q = —1/3 and one/~ with Q = —1. Thus we obtain 
trít3QQ) = 1/234/9 — 1/23:1/9 — 1/21 = 0. This impressive cancellation 
suggests an interplay among weak isospin, charge and colour quantum numbers 
which appears as a miracle from the point of view of the low energy theory but is in 
fact understandable from the point of view of the high energy theory. For example, 
in Grand Unified Theories (GUTs) (for reviews, see, for example, [31]) there are 
similar relations where charge quantization and colour are related: in the five of 
SU(5) we have the content (d, d, d, e+, v) and the charge generator has a vanishing 
trace in each SU(5) representation (the condition of unit determinant, represented by 
the letter S in the SU(5) group name, translates into zero trace for the generators). 
Thus the charge of d quarks is —1/3 of the positron charge because there are three 
colours. A whole family fits perfectly in one 16 of SO(10) which is anomaly free. 
So GUTs can naturally explain the cancellation of the chiral anomaly. 

An important implication of chiral anomalies together with the topological 
properties of the vacuum in non abelian gauge theories is that the conservation of the 
charges associated to baryon (B) and lepton (L) numbers is broken by the anomaly 
[32], so that B and L conservation is actually violated in the standard electroweak 
theory (but B-L remains conserved). B and L are conserved to all orders in the 
perturbative expansion but the violation occurs via non perturbative instanton effects 
[33] (the amplitude is proportional to the typical non perturbative factor exp —c/g7, 
with c a constant and g the SU(2) gauge coupling). The corresponding effect is 
totally negligible at zero temperature T, but becomes relevant at temperatures close 
to the electroweak symmetry breaking scale, precisely at T ~ o(TeV). The non 
conservation of B+L and the conservation of B—L near the weak scale plays a role 
in the theory of baryogenesis that quantitatively aims at explaining the observed 
matter antimatter asymmetry in the Universe (for a recent review, see, for example, 
[34]; see also Chap. 9). 


3.9 QED Tests: Lepton Anomalous Magnetic Moments 


The most precise tests of the electroweak theory apply to the QED sector. Here 
we discuss some recent developments. The anomalous magnetic moments of the 
electron and of the muon are among the most precise measurements in the whole 
of physics. The magnetic moment ji and the spin S are related by ù = — ges /2m, 
where g is the gyromagnetic ratio (g — 2 for a pointlike Dirac particle). The quantity 
a = (g — 2)/2 measures the anomalous magnetic moment of the particle. Recently 
there have been new precise measurements of a, and a,, for the electron [35] and 
the muon [36]: 


*P — 11596521808.5(7.6) 10- P, at^? = 11659208.0(6.3) 310-19. 
(3.89) 
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Fig. 3.10 The hadronic 
contributions to the 
anomalous magnetic moment: 
vacuum polarization (left) 
and light by light scattering 
(right) 


The theoretical calculations in general contain a pure QED part plus the sum of 
hadronic and weak contribution terms: 


a= aq QE? 4 qhadronic y. queak aLa y qb adronic 4 queak (3.90) 


The QED part has been computed analytically fori = 1,2,3, while fori = 4 
there is a numerical calculation with an error (see, for example, [38] and refs 
therein). Some terms for i = 5 have also been estimated for the muon case. The 
hadronic contribution is from vacuum polarization insertions and from light by light 
scattering diagrams (see Fig. 3.10). The weak contribution is from W or Z exchange. 

For the electron case the weak contribution is essentially negligible and the 
hadronic term (qiterante ~ (16.71 + 0.19):107 1?) does not introduce an important 
uncertainty. As a result this measurement can be used to obtain the most precise 
determination of the fine structure constant [37]: 


-1 ~ 137.035999710(96) , (3.91) 


with an uncertainty about 10 times smaller than the previous determination. 
However, very recently a theoretical error in the o^ terms was corrected [39]. As a 
result the value of a~! in Eq. (3.91) is shifted by —6.41180(73) 1077 (about 7 a''s). 
This change has a minor impact in the following discussion of the muon (g — 2). 

In the muon case the experimental precision is less by about three orders of 
magnitude, but the sensitivity to new physics effects is typically increased by a 
factor (m,,/ me)” ~ 410^ (one mass factor arises because the effective operator 
needs a chirality flip and the second one is because, by definition, one must factor 
out the Bohr magneton e/2m). From the theory side, the QED term (using the value 
of a from a, in Eq. (3.91)), and the weak contribution are affected by small errors 
and are given by (all theory number are taken here from the review [40]) 


aQFP = (116584718.09-E 1.610, — avec = (154-2210! — (3.92) 


The dominant ambiguities arise from the hadronic term. The lowest order (LO) 
vacuum polarization contribution can be evaluated from the measured cross sections 
ine*e^ — hadrons at low energy via dispersion relations (the largest contribution 
is from the x7 final state), with the result aZ?-10 1! = 6909 + 44. The higher 
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order (HO) vacuum polarization contribution (from 2-loop diagrams containing an 
hadronic insertion) is given by: afl 0.10-!! = —98 + 1. The contribution of the 
light by light (LbL) scattering diagrams is estimated to be: aZ^.-10!! = 120435. 
Adding the above contributions up the total hadronic result is reported as: 


a arene (603i + Soy 10 (3.93) 
p 


At face value this would lead to a 3.30 deviation from the experimental value ap 
in Eq. (3.89): 


aP — gihe) — (275 + 84) 107". (3.94) 
However, the error estimate on the LbL term, mainly a theoretical uncertainty, is 
not compelling, and it could well be somewhat larger (although probably not by as 
much as to make the discrepancy to completely disappear). Another puzzle is the 
fact that, using the conservation of the vector current (CVC) and isospin invariance, 
which are well established tools at low energy, aL O can also be evaluated from t 
decays. But the results on the hadronic contribution from e* e^ and from r decay, 
nominally of comparable accuracy, do not match well, and the discrepancy would be 
much attenuated if one takes the t result [41]. Since it is difficult to find a theoretical 
reason for the e*e^ vs t difference, one must conclude that there is something 
which is not understood either in the data or in the assessment of theoretical errors. 
The prevailing view is to take the et e^ determination as the most directly reliable, 
which leads to Eq. (3.94), but doubts certainly remain. Finally, we note that, given 
the great accuracy of the a, measurement and the relative importance of the non 
QED contributions, it is not unreasonable that a first signal of new physics can 
appear in this quantity. 


3.10 Large Radiative Corrections to Electroweak Processes 


Since the SM theory is renormalizable higher order perturbative corrections can 
be reliably computed. Radiative corrections are very important for precision EW 
tests. The SM inherits all successes of the old V-A theory of charged currents 
and of QED. Modern tests have focussed on neutral current processes, the W 
mass and the measurement of triple gauge vertices. For Z physics and the W 
mass the state of the art computation of radiative corrections include the complete 
one loop diagrams and selected dominant two loop corrections. In addition some 
resummation techniques are also implemented, like Dyson resummation of vacuum 
polarization functions and important renormalization group improvements for large 
QED and QCD logarithms. We now discuss in more detail sets of large radiative 
corrections which are particularly significant (for reviews of radiative corrections 
for LEP1 physics, see, for example: [42]). 
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Even leaving aside QCD corrections, a set of important quantitative contributions 
to the radiative corrections arise from large logarithms [e.g. terms of the form 
(a/a In (mz/m f1))" where fu is a light fermion]. The sequences of leading and 
close-to-leading logarithms are fixed by well-known and consolidated techniques (6 
functions, anomalous dimensions, penguin-like diagrams, etc.). For example, large 
logarithms from pure QED effects dominate the running of o from me, the electron 
mass, up to mz. Similarly large logarithms of the form [a@/z In (mz/j)]”" also 
enter, for example, in the relation between sin? Ow at the scales mz (LEP, SLC) 
and u (e.g. the scale of low-energy neutral-current experiments). Also, large logs 
from initial state radiation dramatically distort the line shape of the Z resonance as 
observed at LEP! and SLC and this effect was accurately taken into account for 
the measurement of the Z mass and total width. The experimental accuracy on mz 
obtained at LEP1 is ómz = +2.1 MeV (see Chap. 6). Similarly, a measurement of 
the total width to an accuracy ôl’ = +2.3 MeV has been achieved. The prediction of 
the Z line-shape in the SM to such an accuracy has posed a formidable challenge to 
theory, which has been successfully met. For the inclusive process ete~ — f f X, 
with f z e (for a concise discussion, we leave Bhabha scattering aside) and X 
including y's and gluons, the physical cross-section can be written in the form of a 
convolution [42]: 


$ 
Bis f did e. (3.95) 


£0 


where ô is the reduced cross-section, and G(z,s) is the radiator function that 
describes the effect of initial-state radiation; 6 includes the purely weak corrections, 
the effect of final-state radiation (of both y’s and gluons), and also non-factorizable 
terms (initial- and final-state radiation interferences, boxes, etc.) which, being small, 
can be treated in lowest order and effectively absorbed in a modified ô. The radiator 
G(z, s) has an expansion of the form 


G(z, s) = &(1 — z) c a/z (ai L + ao) + (a/2) (aL? +a L +.ay9) + ... + 


+ (a/nY' Y anil! , (3.96) 


i=0 


where L = In s/ m? c 24.2 for J/s ~ mz. All first- and second-order terms 
are known exactly. The sequence of leading and next-to-leading logs can be 
exponentiated (closely following the formalism of structure functions in QCD). For 
mz ^: 91 GeV, the convolution displaces the peak by +110 MeV, and reduces it 
by a factor of about 0.74. The exponentiation is important in that it amounts to an 
additional shift of about 14 MeV in the peak position with respect to the one loop 
radiative correction. 

Among the one loop EW radiative corrections, a very remarkable class of 
contributions are those terms that increase quadratically with the top mass. The 
sensitivity of radiative corrections to m; arises from the existence of these terms. The 
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quadratic dependence on m; (and on other possible widely broken isospin multiplets 
from new physics) arises because, in spontaneously broken gauge theories, heavy 
virtual particles do not decouple. On the contrary, in QED or QCD, the running 
of a and o, at a scale Q is not affected by heavy quarks with mass M > Q. 
According to an intuitive decoupling theorem [43], diagrams with heavy virtual 
particles of mass M can be ignored at Q «& M provided that the couplings do not 
grow with M and that the theory with no heavy particles is still renormalizable. 
In the spontaneously broken EW gauge theories both requirements are violated. 
First, one important difference with respect to unbroken gauge theories is in the 
longitudinal modes of weak gauge bosons. These modes are generated by the Higgs 
mechanism, and their couplings grow with masses (as is also the case for the 
physical Higgs couplings). Second the theory without the top quark is no more 
renormalizable because the gauge symmetry is broken as the (t,b) doublet would 
not be complete (also the chiral anomaly would not be completely cancelled). 
With the observed value of m; the quantitative importance of the terms of order 
G pm? /An?4/2 is substancial but not dominant (they are enhanced by a factor 
m? / mi, ~ 5 with respect to ordinary terms). Both the large logarithms and the 
G p m? terms have a simple structure and are to a large extent universal, i.e. common 
to a wide class of processes. In particular the G pm? terms appear in vacuum 
polarization diagrams which are universal (virtual loops inserted in gauge boson 
internal lines are independent of the nature of the vertices on each side of the 
propagator) and in the Z — bb vertex which is not. This vertex is specifically 
sensitive to the top quark which, being the partner of the b quark in a doublet, runs in 
the loop. Instead all types of heavy particles could in principle contribute to vacuum 
polarization diagrams. The study of universal vacuum polarization contributions, 
also called "oblique" corrections, and of top enhanced terms is important for 
an understanding of the pattern of radiative corrections. More in general, the 
important consequence of non decoupling is that precision tests of the electroweak 
theory may apriori be sensitive to new physics even if the new particles are too 
heavy for their direct production, but aposteriori no signal of deviation has clearly 
emerged. 

While radiative corrections are quite sensitive to the top mass, they are unfortu- 
nately much less dependent on the Higgs mass. If they were sufficiently sensitive 
by now we would precisely know the mass of the Higgs. But the dependence 
of one loop diagrams on my is only logarithmic: ~ G pm? log(m?, / m3, ). 
Quadratic terms ~ Gis, only appear at two loops [44] and are too small to 
be detectable. The difference with the top case is that the splitting m? — m; 
is a direct breaking of the gauge symmetry that already affects the 1- loop 
corrections, while the Higgs couplings are “custodial” SU(2) symmetric in lowest 
order. 
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3.11 Electroweak Precision Tests in the SM and Beyond 


For the analysis of electroweak data in the SM one starts from the input parameters: 
as is the case in any renormalizable theory, masses and couplings have to be 
specified from outside. One can trade one parameter for another and this freedom is 
used to select the best measured ones as input parameters. Some of them, a, G p and 
mz, are very precisely known, as we have seen, some other ones, m f; ghi» Mt and 
as (mz) are less well determined while mg is largely unknown. Among the light 
fermions, the quark masses are badly known, but fortunately, for the calculation 
of radiative corrections, they can be replaced by a(mz), the value of the QED 
running coupling at the Z mass scale. The value of the hadronic contribution to 
the running, embodied in the value of Aa, (m2) (see Table 3.1, [8] ) is obtained 
through dispersion relations from the data on et e^ — hadrons at moderate centre- 
of-mass energies. From the input parameters one computes the radiative corrections 
to a sufficient precision to match the experimental accuracy. Then one compares the 
theoretical predictions with the data for the numerous observables which have been 
measured [45], checks the consistency of the theory and derives constraints on mz, 
a, (mz) and mp. A detailed discussion of all experimental aspects of precision tests 
of the EW theory is presented in Chap. 6. 
The basic tree level relations: 


2 42 2 
= —, sinf Ow = e^ = 4ra 3.97 
8miy a è v ( 
can be combined into 
sin? 0 i (3.98) 
w = ————— : 
V2G rmi, 


Always at tree level, a different definition of sin? Ow is from the gauge boson 
masses: 


2 2 


m m 
— Z — =m=1 = sin’ oy =1-—¥ (3.99) 
mz cos Ow m 


where po = | assuming that there are only Higgs doublets. The last two relations 
can be put into the convenient form 


m2 m2 Ta 
(c E Tur e (3.100) 
m2, m?, J/2G p m?, 
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Beyond tree level, these relations are modified by radiative corrections: 


a- aa za(mz) 1 
m? m?, JG pm, ]— Arw 
w 
——— =1+4+A 3.101 
m2, cos? ðw Pm ( ) 


The Z and W masses are to be precisely defined in terms of the pole position in 
the respective propagators. Then, in the first relation the replacement of œ with the 
running coupling at the Z mass a (mz) makes Arw completely determined at 1-loop 
by purely weak corrections (G r is protected from logarithmic running as an indirect 
consequence of (V-A) current conservation in the massless theory). This relation 
defines Arw unambigously, once the meaning of o (mz) is specified (for example, 
M S). On the contrary, in the second relation Apm depends on the definition of 
sin? Ow beyond the tree level. For LEP physics sin? Ow is usually defined from the 
Z — u™ u” effective vertex. At the tree level the vector and axial-vector couplings 
gy and gh are given in Eqs. (3.29). Beyond the tree level a corrected vertex can be 
written down in terms of modified effective couplings. Then sin? Ow = sin? er f is 
in general defined through the muon vertex: 


gbegi = 1—4sin* bere 


sin? eff = (1+ Ak)sĝ, seca = 


wr 1 
ga = 4M Ap) (3.102) 


We see that s and cå are “improved” Born approximations (by including the 
running of o) for sin? 8e ff and cos? 0, ff. Actually, since in the SM lepton 
universality is only broken by masses and is in agreement with experiment within 
the present accuracy, in practice the muon channel can be replaced with the average 
over charged leptons. 

We can write a symbolic equation that summarizes the status of what has been 
computed up to now for the radiative corrections (we list some recent work on each 
item from where older references can be retrieved) Arw [46], Ap [47] and Ak [48]: 


m4 2 
m 
Arw, Ap, Ak = g^ ile a ha pe CRE F +g +. 
my my my 
(3.103) 


The meaning of this relation is that the one loop terms of order g? are completely 
known, together with their first order QCD corrections (the second order QCD 
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corrections are only estimated for the g? terms not enhanced by m? / m5), and the 
terms of order g^ enhanced by the ratios m? / miy or m? / mi, are also known. 

In the SM the quantities Arw, Ap, Ak, for sufficiently large m;, are all 
dominated by quadratic terms in m; of order G pm. The quantity Ap» is not 
independent and can expressed in terms of them. As new physics can more easily be 
disentangled if not masked by large conventional m; effects, it is convenient to keep 
Ap while trading Arw and Ak for two quantities with no contributions of order 
G pm. One thus introduces the following linear combinations (epsilon parameters) 
[49]: 


€ = Ap, 
2 
Sí Ar 
e = gap + 2 — 2s$ Ak, 
Co — So 
EE 2 2 
63 = cAp + (cd — sS) AK. (3.104) 


The quantities €2 and €3 no longer contain terms of order G pm? but only logarithmic 
terms in m;. The leading terms for large Higgs mass, which are logarithmic, are 
contained in e; and €3. To complete the set of top-enhanced radiative corrections 
one adds e; defined from the loop corrections to the Zbb vertex. One modifies gł 
and g^ as follows: 


TEN jee) 
SA > 2 2 b) 
b . 2 
1 — 4/3 sinf beff + € 
is = I4 sm depen (3.105) 
gA 1+ €p 


€p can be measured from Rp = I'(Z > bb)/ I'(Z — hadrons) (see Table 3.1). 
This is clearly not the most general deviation from the SM in the Z — bb vertex 
but ej is the quantity where the large m; corrections are located in the SM. Thus, 
summarizing, in the SM one has the following "large" asymptotic contributions: 


E€ = = aug um Own oa 


Grm2 m 
ume MESS. area 
Qn2/2 mz 
= Grm mH Grm, m, 
e = ————]ln— — ————-ln——.... 
12322. mz 622,/2 mz 
Grm? 


€) = Peris (3.106) 
? An?24/2 
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The e; parameters vanish in the limit where only tree level SM effects are kept 
plus pure QED and/or QCD corrections. So they describe the effects of quantum 
corrections (i.e. loops) from weak interactions. A similar set of parameters are the 
S, T, U parameters [50]: the shifts induced by new physics on S, T and U are 
proportional to those induced on e3, e; and e», respectively. In principle, with no 
model dependence, one can measure the four e; from the basic observables of LEP 
physics F(Z > pty), Afp and Rp on the Z peak plus mw. With increasing 
model dependence, one can include other measurements in the fit for the e;. For 
example, use lepton universality to average the u with the e and x final states, or 
include all lepton asymmetries and so on. The present experimental values of the ej, 
obtained from a fit of all LEP1-SLD measurements plus my, are given by The LEP 
Electroweak Working Group [8]: 


ej 10 2 5.41.0, 65102 —89412, 
e3 "10° = 5.34 + 0.94, ej 10? = —5.0 + 1.6. (3.107) 


Note that the € parameters are of order a few in 107? and are known with an accuracy 
in the range 15-30%. As discussed in the next Section, these values are in agreement 
with the SM with a light Higgs. All models of new physics must be compared with 
these findings and pass this difficult test. 


3.12 Results of the SM Analysis of Precision Tests 


The electroweak Z pole measurements, combining the results of all the experiments, 
are summarised in Table 3.1. The various asymmetries determine the effective 
electroweak mixing angle for leptons with highest sensitivity. The weighted average 
of these results, including small correlations, is: 


sin? eff = 0.23153 + 0.00016, (3.108) 


Note, however, that this average has a x? of 11.8 for 5 degrees of freedom, 
corresponding to a probability of a few 96. The x? is pushed up by the two most 
precise measurements of sin? Oeff, namely those derived from the measurements 
of A; by SLD, dominated by the left-right asymmetry A9 p» and of the forward- 


backward asymmetry measured in bb production at LEP, Ave , which differ by about 
308. 

We now discuss fitting the data in the SM. One can think of different types 
of fit, depending on which experimental results are included or which answers 
one wants to obtain. For example, in Table 3.2 we present in column 1 a fit of 
all Z pole data plus mw and Iw (this is interesting as it shows the value of m; 
obtained indirectly from radiative corrections, to be compared with the value of 
m, measured in production experiments), in column 2 a fit of all Z pole data plus 
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oe ST of Observable Measurement SM fit 

oh et Q? [8] mz [GeV] 91.1875 0.0021 | 91.1875 
Tz [GeV] 2.4952 + 0.0023 2.4957 
c? [nb] 41.540 + 0.037 41.477 
R? 20.767 0.025 20.744 
AF p9! 0.01714 + 0.00095 0.01645 
A; (SLD) 0.1513 + 0.0021 0.1481 
Aj (Pr) 0.1465 + 0.0032 | 0.1481 
R? 0.21629 + 0.00066 | 0.21586 
R? 0.1721 + 0.0030 0.1722 
AM 0.0992 + 0.0016 | 0.1038 
FER 0.0707 + 0.0035 0.0742 
Ap 0.923 + 0.020 0.935 
Ac 0.670 + 0.027 0.668 
sin? Bess (Q) 0.2324 + 0.0012 0.2314 
mw [GeV] 80.398 + 0.025 80.374 - 
Tw [GeV] 2.140 + 0.060 | 2.091 
m; [GeV (pp) 170.9 + 1.8 [171.3 
Aa® (m?) 0.02758 + 0.00035 | 0.02768 


The first block shows the Z-pole measurements. The second 
block shows additional results from other experiments: the 
mass and the width of the W boson measured at the Tevatron 
and at LEP-2, the mass of the top quark measured at the 
Tevatron, and the contribution to œ of the hadronic vacuum 
polarization. The SM fit results are derived from the SM 
analysis of these results 


m, (here it is mw which is indirectly determined), and, finally, in column 3 a fit 
of all the data listed in Table 3.1 (which is the most relevant fit for constraining 
my). From the fit in column 1 of Table 3.2 we see that the extracted value of 
m; is in good agreement with the direct measurement (see Table 3.1). Similarly 
we see that the experimental measurement of my in Table 3.1 is larger by about 
one standard deviation with respect to the value from the fit in column 2. We 
have seen that quantum corrections depend only logarithmically on mg. In spite 
of this small sensitivity, the measurements are precise enough that one still obtains 
a quantitative indication of the mass range. From the fit in column 3 we obtain: 
logio mH (GeV) = 1.88 + 0.16 (or my = 76754 GeV). This result on the Higgs 
mass is particularly remarkable. The value of log; mg (GeV) is compatible with 
the small window between ~2 and ~3 which is allowed, on the one side, by the 
direct search limit (my > 114 GeV from LEP-2 [8]), and, on the other side, by the 
theoretical upper limit on the Higgs mass in the minimal SM, my < 600— 800 GeV 
[51]. 

Thus the whole picture of a perturbative theory with a fundamental Higgs is well 
supported by the data on radiative corrections. It is important that there is a clear 
indication for a particularly light Higgs: at 9596 c.l. my 182 GeV (including 


^ 
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Table 3.2 Standard Model fits of electroweak data [8] 


Fit 1 |2 3 

Measurements mw m; Mte, MW 

m, (GeV) 178.9* 5° 170.9 + 1.8 171.3 2 1.7 

my (GeV) 14510 9915 7657 

log [my (GeV)] 2.16 + +0.39 2.00 + 0.19 1.88 + 0.16 

ots (mz) 0.1190 + 0.0028 0.1189 + 0.0027 0.1185 + 0.0026 
mw (MeV) 80385 + 19 80360 + 20 80374 + 15 


All fits use the Z pole results and Aa), (m3) as listed in Table 3.1. In addition, the measurements 
listed on top of each column are included as well. The fitted W mass is also shown [8] (the directly 
measured value is my = 80398 + 25 MeV) 


the input from the direct search result). This is quite encouraging for the ongoing 
search for the Higgs particle. More general, if the Higgs couplings are removed 
from the Lagrangian the resulting theory is non renormalizable. A cutoff A must 
be introduced. In the quantum corrections log mg is then replaced by log A plus 
a constant. The precise determination of the associated finite terms would be lost 
(that is, the value of the mass in the denominator in the argument of the logarithm). 
A heavy Higgs would need some unfortunate accident: the finite terms, different in 
the new theory from those of the SM, should by chance compensate for the heavy 
Higgs in a few key parameters of the radiative corrections (mainly e, and es, see, 
for example, [49]). Alternatively, additional new physics, for example in the form 
of effective contact terms added to the minimal SM lagrangian, should accidentally 
do the compensation, which again needs some sort of conspiracy. 

To the list of precision tests of the SM one should add the results on low 
energy tests obtained from neutrino and antineutrino deep inelastic scattering 
(NuTeV [52]), parity violation in Cs atoms (APV [53]) and the recent measurement 
of the parity-violating asymmetry in Moller scattering [54] (see Chap. 6). When 
these experimental results are compared with the SM predictions the agreement 
is good except for the NuTeV result that shows a deviation by three standard 
deviations. The NuTeV measurement is quoted as a measurement of sin? Oy = 
1] — mi. / m3 from the ratio of neutral to charged current deep inelastic cross- 
sections from v, and v, using the Fermilab beams. But it has been argued and it 
is now generally accepted that the NuTeV anomaly probably simply arises from an 
underestimation of the theoretical uncertainty in the QCD analysis needed to extract 
sin? Øw. In fact, the lowest order QCD parton formalism on which the analysis has 
been based is too crude to match the experimental accuracy. 

When confronted with these results, on the whole the SM performs rather well, 
so that it is fair to say that no clear indication for new physics emerges from the 
data. However, as already mentioned, one problem is that the two most precise 
measurements of sin? Oeff from Az pr and AL, g differ by about 30s. In general, there 
appears to be a discrepancy between sin? Qs; measured from leptonic asymmetries 
((sin? 0err)1) and from hadronic asymmetries ((sin? Oeff)n). In fact, the result from 
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Fig. 3.11 The data for 
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Arg is in good agreement with the leptonic asymmetries measured at LEP, while 
all hadronic asymmetries, though their errors are large, are better compatible with 
the result of Ab, g- These two results for sin? 04g are shown in Fig. 3.11 [55]. Each of 
them is plotted at the m y value that would correspond to it given the central value 
of m;. Of course, the value for my indicated by each sin? Oere has an horizontal 
ambiguity determined by the measurement error and the width of the +1o band for 
m,. Even taking this spread into account it is clear that the implications on my are 
sizably different. One might imagine that some new physics effect could be hidden 
in the Zbb vertex. Like for the top quark mass there could be other non decoupling 
effects from new heavy states or a mixing of the b quark with some other heavy 
quark. However, it is well known that this discrepancy is not easily explained in 
terms of some new physics effect in the Zbb vertex. A rather large change with 
respect to the SM of the b-quark right handed coupling to the Z is needed in order to 
reproduce the measured discrepancy (precisely a ~30% change in the right-handed 
coupling), an effect too large to be a loop effect but which could be produced at the 
tree level, e.g., by mixing of the b quark with a new heavy vectorlike quark [56]), 
or some mixing of the Z with ad hoc heavy states [57]. But then this effect should 
normally also appear in the direct measurement of A; performed at SLD using the 
left-right polarized b asymmetry, even within the moderate precision of this result. 
The measurements of neither Ap at SLD nor Rp confirm the need of a new effect. 
Alternatively, the observed discrepancy could be simply due to a large statistical 
fluctuation or an unknown experimental problem. As a consequence of this problem, 
the ambiguity in the measured value of sin? Oe is in practice larger than the nominal 
error, reported in Eq. 3.108, obtained from averaging all the existing determinations, 
and the interpretation of precision tests is less sharp than it would otherwise be. 

We have already observed that the experimental value of my (with good 
agreement between LEP and the Tevatron) is a bit high compared to the SM 
prediction (see Fig. 3.12). The value of mg indicated by my is on the low side, 


ä : P ‘ lept i : 
just in the same interval as for sin? Nd measured from leptonic asymmetries. 
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In conclusion, overall the validity of the SM has been confirmed to a level that we 
can say was unexpected at the beginning. In the present data there is no significant 
evidence for departures from the SM, no compelling evidence of new physics. The 
impressive success of the SM poses strong limitations on the possible forms of new 
physics. 


3.13 Phenomenology of the SM Higgs 


The Higgs problem is really central in particle physics today. On the one hand, 
the experimental verification of the Standard Model (SM) cannot be considered 
complete until the structure of the Higgs sector is not established by experiment. 
On the other hand, the Higgs is also related to most of the major problems of 
particle physics, like the flavour problem and the hierarchy problem, the latter 
strongly suggesting the need for new physics near the weak scale. In turn the 
discovery of new physics could clarify the dark matter identity. It is clear that the 
fact that some sort of Higgs mechanism is at work has already been established. 
The W or the Z with longitudinal polarization that we observe are not present in an 
unbroken gauge theory (massless spin-1 particles, like the photon, are transversely 
polarized). The longitudinal degree of freedom for the W or the Z is borrowed from 
the Higgs sector and is an evidence for it. Also, it has been verified that the gauge 
symmetry is unbroken in the vertices of the theory: all currents and charges are 
indeed symmetric. Yet there is obvious evidence that the symmetry is instead badly 
broken in the masses. Not only the W and the Z have large masses, but the large 
splitting of, for example, the t-b doublet shows that even a global weak SU(2) is 
not at all respected by the fermion spectrum. This is a clear signal of spontaneous 
symmetry breaking and the implementation of spontaneous symmetry breaking in a 
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gauge theory is via the Higgs mechanism. The big remaining questions are about 
the nature and the properties of the Higgs particle(s). The present experimental 
information on the Higgs sector, is surprisingly limited and can be summarized in 
a few lines, as follows. First, the relation My = MZ cos? Ow, Eq. (3.55), modified 
by small, computable radiative corrections, has been experimentally proven. This 
relation means that the effective Higgs (be it fundamental or composite) is indeed 
a weak isospin doublet. The Higgs particle has not been found but, in the SM, its 
mass can well be larger than the present direct lower limit my Z 114 GeV (at 
95% c.l.) obtained from searches at LEP-2. The radiative corrections computed 
in the SM when compared to the data on precision electroweak tests lead to a 
clear indication for a light Higgs, not too far from the present lower bound. The 
exact experimental upper limit for mg in the SM depends on the value of the top 
quark mass m;. The CDF and DO combined value after Run II is at present [8] 
m, = 170.9 + 1.8 GeV (it went down with respect to the value m; = 178 + 
4.3 GeV from Run I and also the experimental error is now sizably reduced). As 
a consequence the present limit on my is more stringent [8]: my < 182 GeV (at 
95% c.l., after including the information from the 114 GeV direct bound). On the 
Higgs the LHC will address the following questions : one doublet, more doublets, 
additional singlets? SM Higgs or SUSY Higgses? Fundamental or composite (of 
fermions, of WW...)? Pseudo-Goldstone boson of an enlarged symmetry? A 
manifestation of large extra dimensions (5th component of a gauge boson, an effect 
of orbifolding or of boundary conditions...)? Or some combination of the above 
or something so far unthought of? Here in the following we will summarize the 
main properties of the SM Higgs that provide an essential basis for the planning 
and the interpretation of the LHC Higgs programme. We start from the mass, 
then the width and the branching ratios and, finally, the most important production 
channels. 


3.13.1 Theoretical Bounds on the SM Higgs Mass 


It is well known [58-60] that in the SM with only one Higgs doublet a lower limit 
on my can be derived from the requirement of vacuum stability (or, in milder 
form, of a moderate instability, compatible with the lifetime of the Universe [61]). 
The limit is a function of m, and of the energy scale A where the model breaks 
down and new physics appears. The Higgs mass enters because it fixes the initial 
value of the quartic Higgs coupling A for its running up to the large scale A. 
Similarly an upper bound on mg (with mild dependence on m+) is obtained, as 
described in [62] and refs. therein, from the requirement that for à no Landau pole 
appears up to the scale A, or in simpler terms, that the perturbative description 
of the theory remains valid up to A. We now briefly recall the derivation of these 
limits. 

The possible instability of the Higgs potential V [4$] is generated by the quantum 
loop corrections to the classical expression of V[ó]. At large $ the derivative 
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V'[ó] could become negative and the potential would become unbound from below. 
The one-loop corrections to V[$] in the SM are well known and change the 
dominant term at large @ according to A94 — (A + y log $?/ A?)ó*. This one- 
loop approximation is not enough in this case, because it fails at large enough $, 
when y log $?/ A? becomes of order one. The renormalization group improved 
version of the corrected potential leads to the replacement AQ — MA) (A) 
where A(A) is the running coupling and $'(j4) = $ exp P y (t)dt', with y (t) being 
an anomalous dimension function and t = logA/v (v is the vacuum expectation 
value v = (2//2G p) !/?). As a result, the positivity condition for the potential 
amounts to the requirement that the running coupling A(A) never becomes negative. 
A more precise calculation, which also takes into account the quadratic term in the 
potential, confirms that the requirements of positive A(A) leads to the correct bound 
down to scales A as low as ~1 TeV. The running of A(A) at one loop is given 
by: 


d 


di = pal” + 34h? — 9h? + small gauge and Yukawa terms] , (3.109) 
with the normalization such that at t = 0, à = Ao = m^, / 2v? and the top Yukawa 
coupling h? = m,/v. We see that, for m y small and m, fixed at its measured value, 
à decreases with t and can become negative. If one requires that à remains positive 
up to A = 10!5-10!° GeV, then the resulting bound on m y in the SM with only one 
Higgs doublet is given by, (also including the effect of the two-loop beta function 
terms) [60] : 


— 0.11 
mg (GeV) > 128.4 + 2.1 [m, — 170.9] — 4.5 2 (3.110) 


Note that this limit is evaded in models with more Higgs doublets. In this case the 
limit applies to some average mass but the lightest Higgs particle can well be below, 
as it is the case in the minimal SUSY extension of the SM (MSSM). 

The upper limit on the Higgs mass in the SM is clearly important for assessing 
the chances of success of the LHC as an accelerator designed to solve the Higgs 
problem. The upper limit [62] arises from the requirement that the Landau pole 
associated with the non asymptotically free behaviour of the A¢* theory does not 
occur below the scale A. The initial value of A at the weak scale increases with 
my and the derivative is positive at large À (because of the positive 47 term—the 
A. theory is not asymptotically free—which overwhelms the negative top- Yukawa 
term). Thus, if my is too large, the point where à computed from the perturbative 
beta function becomes infinite (the Landau pole) occurs at too low an energy. Of 
course in the vicinity of the Landau pole the 2-loop evaluation of the beta function 
is not reliable. Indeed the limit indicates the frontier of the domain where the theory 
is well described by the perturbative expansion. Thus the quantitative evaluation 
of the limit is only indicative, although it has been to some extent supported by 
simulations of the Higgs sector of the EW theory on the lattice. For the upper limit 
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on my one finds [62] 


mg S 180GeV for A ~ Maur — Mp; 
mg S; 0.5 — 0.8TeV for A ~ 1 TeV. (3.111) 


In conclusion, for m; ~ 171 GeV, only a small range of values for mg is allowed, 
130 < mg < ~ 200GeV, if the SM holds up to A ^ Mgvur or Mp). 

An additional argument indicating that the solution of the Higgs problem cannot 
be too far away is the fact that, in the absence of a Higgs particle or of an alternative 
mechanism, violations of unitarity appear in some scattering amplitudes at energies 
in the few TeV range [63]. In particular, amplitudes involving longitudinal gauge 
bosons (those most directly related to the Higgs sector) are affected. For example, 


at tree level in the absence of Higgs exchange, for s >> m?, one obtains: 


E .8 
A(WiW, — ZLZL)no Higgs ~ is (3.112) 


In the SM this unacceptable large energy behaviour is quenched by the Higgs 
exchange diagram contribution: 


2 

Kj 
A(QV*W- — Zr Zr)Hiess ~ —i —— 3.113 
( LL L L)Higgs ws — m2) ( ) 

Thus the total result in the SM is: 

2 

+ E SMH 
A(W; W; > ZLZL)SM ^v — (3.114) 


jee 
v2(s — m7) 


which at large energies saturates at a constant value. To be compatible with unitarity 
bounds one needs m5, < An 42/Gp ormy < 1.5 TeV. Both the Landau pole and 
the unitarity argument show that, if the Higgs is too heavy, the SM becomes a non 
perturbative theory at energies of o(1 TeV). In conclusion, these arguments imply 
that the SM Higgs cannot escape detection at the LHC. 


3.13.2 SM Higgs Decays 


The total width and the branching ratios for the SM Higgs as function of my are 
given in Figs. 3.13 and 3.14, respectively [64]. 

Since the couplings of the Higgs particle are in proportion to masses, when mg 
increases the Higgs becomes strongly coupled. This is reflected in the sharp rise of 
the total width with m y. For m y near its present lower bound of 114 GeV, the width 
is below 5 MeV, much less than for the W or the Z which have a comparable mass. 
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Fig. 3.13 The total width of 1000 
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100 
10 
0. 
0.0 
0.00 


| | | | | | | 
100 130 160 200 300 500 700 1000 


T(H) [GeV] 


Mass mj [GeV] 
Fig. 3.14 The branching 
ratios ofthe SM Higgs boson: — .— | Ew — 7» aaa 
0. WW 
[65] `~------] 
gaan CUR MN 
= tt 
| | 
300 500 


Mass my [GeV] 


The dominant channel for such a Higgs is H —> bb. In Born approximation the 
partial width into a fermion pair is given by Djouadi [64] and Haber [66]: 


GF 
TG > ff) -5N amis (3.115) 


where By = (1 — 4m? "7 m ius. 2. The factor of 6? appears because the fermion pair 
must be in a p-state ak orbital angular momentum for a Higgs with scalar coupling, 
because of parity (this factor would be f for a pseudoscalar coupling). We see that 
the width is suppressed by a factor m / my with respect to the natural size G rm 
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for the width of a particle of mass my decaying through a diagram with only one 
weak vertex. 

A glance to the branching ratios shows that the branching ratio into r pairs is 
larger by more than a factor of two with respect to the cc channel. This is at first 
sight surprising because the colour factor Nc favours the quark channels and the 
masses of t’s and of D mesons are quite similar. This is due to the fact that the 
QCD corrections replace the charm mass at the scale of charm with the charm 
mass at the scale my, which is lower by about a factor of 2.5. The masses run 
logarithmically in QCD, similar to the coupling constant. The corresponding logs 
are already present in the 1-loop QCD correction that amounts to the replacement 
m? — m2[1 + 2a /7 (log m2 / mj, + 3/2)] ~ mz (m4). 

The Higgs width sharply increases as the WW threshold is approached. For decay 
into a real pair of V's, with V — W, Z, one obtains in Born approximation [64, 66]: 


T(H > VV) Grm g Bw(1 — 4x + 12x?) (3.116) 
— = — 4x X . 
ihe m Y 


where By = 4/1— 4x with x = m/m? and ôw = 2, 5z = 1. Much above 
threshold the VV channels are dominant and the total width, given approximately 
by: 


1 s y (3.117) 
becomes very large, signalling that the Higgs sector is becoming strongly interacting 
(recall the upper limit on the SM Higgs mass in Eq. (3.111)). The VV dominates 
over the tf because of the £ threshold factors that disfavour the fermion channel 
and, at large my, by the cubic versus linear behaviour with m y of the partial widths 
for VV versus tf. Below the VV threshold the decays into virtual V particles is 
important: V V* and V*V*. Note in particular the dip of the ZZ branching ratio 
just below the ZZ threshold: this is due to the fact that the W is lighter than the Z 
and the opening of its threshold depletes all other branching ratios. When the ZZ 
threshold is also passed then the ZZ branching fraction comes back to the ratio of 
approximately 1:2 with the WW channel (just the number of degrees of freedom: 
two hermitian fields for the W, one for the Z). 

The decay channels into y y, Zy and gg proceed through loop diagrams, with 
the contributions from W (only for y y and Zy ) and from fermion loops (for all) 
(Fig. 3.15). 

We reproduce here the results for (H — y y) and T(H — gg) [64, 66]: 


yag EL HE OF Aj (cy)? (3.118) 
= T T i 
yY 1287 TERA Aw(tw C ff 
G 
Gites Grom, 3 Asap (3.119) 
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Fig. 3.15 One-loop diagrams Ns 
for Higgs decay into yy, Zy H H 
and gg 


where rj; = my, /Am? and: 


| 9 


Afa) = Slt t (c — DFO) 


N 


T 
Aw(t) = Ar + 3r + 302r - Df) (3.120) 
with: 


f(t) = arcsin? /t for t< 1 


10 1+/1 r] 
f(t) = -30 H —in] forrt>l (3.121) 
— -T 


For H — yy (as well as for H — Zy) the W loop is the dominant contribution at 
small and moderate mj. We recall that the yy mode can be a possible channel for 
Higgs discovery only for m g near its lower bound (i.e for 114 < my < 150 GeV). 
In this domain of mg we have l'(H. — yy)76-23 KeV. For example, in the 
limit my << 4m?, or t — 0, we have Ayw(0) = —7 and Ay(0) = 4/3. The 
two contributions become comparable only for my ~ 650 GeV where the two 
amplitudes, still of opposite sign, nearly cancel. The top loop is dominant among 
fermions (lighter fermions are suppressed by m^. / m, modulo logs) and, as we have 
seen, it approaches a constant for large m;. Thus the fermion loop amplitude for 
the Higgs would be sensitive to effects from very heavy fermions, in particular the 
H — gg effective vertex would be sensitive to all possible very heavy coloured 
quarks. As discussed in the QCD Chapter (Chap.4) the gg — H vertex provides 
one of the main production channels for the Higgs at hadron colliders. 


3.14 Limitations of the Standard Model 


No signal of new physics has been found neither in electroweak precision tests nor 
in flavour physics. Given the success of the SM why are we not satisfied with this 
theory? Why not just find the Higgs particle, for completeness, and declare that 
particle physics is closed? The reason is that there are both conceptual problems 
and phenomenological indications for physics beyond the SM. On the conceptual 
side the most obvious problems are that quantum gravity is not included in the SM 
and the related hierarchy problem. Among the main phenomenological hints for new 
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physics we can list coupling unification, dark matter, neutrino masses (discussed in 
Sect. (3.7)), baryogenesis and the cosmological vacuum energy. 

The computed evolution with energy of the effective SM gauge couplings clearly 
points towards the unification of the electro-weak and strong forces (GUT’s) at 
scales of energy Maur ~ 10? — 10/6 GeV [31] which are close to the scale 
of quantum gravity, Mp; ~ 10!° GeV. One is led to imagine a unified theory 
of all interactions also including gravity (at present superstrings provide the best 
attempt at such a theory). Thus GUT's and the realm of quantum gravity set a 
very distant energy horizon that modern particle theory cannot ignore. Can the SM 
without new physics be valid up to such large energies? One can imagine that some 
obvious problems could be postponed to the more fundamental theory at the Planck 
mass. For example, the explanation of the three generations of fermions and the 
understanding of fermion masses and mixing angles can be postponed. But other 
problems must find their solution in the low energy theory. In particular, the structure 
of the SM could not naturally explain the relative smallness of the weak scale of 
mass, set by the Higgs mechanism at u ~ 1/4/G x ~ 250 GeV with Gp being the 
Fermi coupling constant. This so-called hierarchy problem is due to the instability 
of the SM with respect to quantum corrections. This is related to the presence of 
fundamental scalar fields in the theory with quadratic mass divergences and no 
protective extra symmetry at u = 0. For fermion masses, first, the divergences are 
logarithmic and, second, they are forbidden by the SU (2) & U (1) gauge symmetry 
plus the fact that at m = 0 an additional symmetry, i.e. chiral symmetry, is restored. 
Here, when talking of divergences, we are not worried of actual infinities. The 
theory is renormalizable and finite once the dependence on the cut off A is absorbed 
in a redefinition of masses and couplings. Rather the hierarchy problem is one of 
naturalness. We can look at the cut off as a parameterization of our ignorance on the 
new physics that will modify the theory at large energy scales. Then it is relevant to 
look at the dependence of physical quantities on the cut off and to demand that no 
unexplained enormously accurate cancellations arise. 

The hierarchy problem can be put in very practical terms (the “little hierarchy 
problem"): loop corrections to the Higgs mass squared are quadratic in A. The most 
pressing problem is from the top loop. With m; = m; arre ôm? the top loop gives 


3GF 
mii) Ps api ~ —(0.2A)? (3.122) 


If we demand that the correction does not exceed the light Higgs mass indicated 
by the precision tests, A must be close, A ^ o(1 TeV). Similar constraints arise 
from the quadratic A dependence of loops with gauge bosons and scalars, which, 
however, lead to less pressing bounds. So the hierarchy problem demands new 
physics to be very close (in particular the mechanism that quenches the top loop). 
Actually, this new physics must be rather special, because it must be very close, yet 
its effects are not clearly visible neither in precision electroweak tests (the “LEP 
Paradox” [67]) nor in flavour changing processes and CP violation. Examples of 
proposed classes of solutions for the hierarchy problem are: (1) Supersymmetry 
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[68]. In the limit of exact boson-fermion symmetry the quadratic divergences 
of bosons cancel so that only log divergences remain. However, exact SUSY is 
clearly unrealistic. For approximate SUSY (with soft breaking terms), which is the 
basis for all practical models, A is replaced by the splitting of SUSY multiplets, 
A? ~ ne Sy — m2. q- In particular, the top loop is quenched by partial cancellation 
with s-top exchange, so the s-top cannot be too heavy. (2) Technicolor [69]. The 
Higgs system is a condensate of new fermions. There is no fundamental scalar 
Higgs sector, hence no quadratic divergences associated to the u? mass in the scalar 
potential. This mechanism needs a very strong binding force, Arc ~ 10? AQCD. 
It is difficult to arrange that such nearby strong force is not showing up in precision 
tests. Hence this class of models has been disfavoured by LEP, although some 
special class of models have been devised aposteriori, like walking TC, top-color 
assisted TC etc (for recent reviews, see, for example, [69]). (3) Extra dimensions (for 
a recent review, see, for example, [70]). The idea is that M p; appears very large, or 
equivalently that gravity appears very weak, because we are fooled by hidden extra 
dimensions so that either the real gravity scale is reduced down to a lower scale, 
even possibly down to o(1 T'eV) or the intensity of gravity is red shifted away by 
an exponential warping factor [71]. This possibility is very exciting in itself and it 
is really remarkable that it is compatible with experiment. It provides a very rich 
framework with many different scenarios. (4) "Little Higgs" models [72]. In these 
models the Higgs is a pseudo-Goldstone boson and extra symmetries allow mp 4 0 
only at two-loop level, so that A can be as large as o(10 TeV) with the Higgs within 
present bounds (the top loop is quenched by exchange of heavy vectorlike new 
quarks with charge 2/3). The physics beyond the SM will be discussed in Chap. 8. 
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Chapter 4 A) 
QCD: The Theory of Strong Interactions yss 


Guido Altarelli and Stefano Forte 


4.1 Introduction 


This Chapter! is devoted to a concise introduction to Quantum Chromo-Dynamics 
(QCD), the theory of strong interactions [1-3]. We start with a general introduction 
where a broad overview of the strong interactions is presented. The basic principles 
and the main applications of perturbative QCD will be discussed first (for reviews 
of the subject, see, for example, [4—6]). Then the methods of non perturbative QCD 
will be introduced, first the analytic approaches and then the simulations of the 
theory on a discrete space-time lattice. The main emphasis will be on ideas with a 
minimum of technicalities. 

As discussed in Chap.2 the QCD theory of strong interactions is an unbroken 
gauge theory based on the group SU (3) of colour. The eight massless gauge bosons 
are the gluons gi and matter fields are colour triplets of quarks q7 (in different 
flavours i). Quarks and gluons are the only fundamental fields of the Standard Model 
(SM) with strong interactions (hadrons). As discussed in Chap. 2, the statement that 
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QCD is a renormalisable gauge theory based on the group SU (3) with colour triplet 
quark matter fields [7] fixes the QCD lagrangian density to be: 


8 nf 
1 ' Lo 
Ce = J aad ih + ) qj —mj)qj (4.1) 


A=1 j=l 


Here: qj are the quark fields (of n different flavours) with mass mj; D = Duy", 
where y are the Dirac matrices and D,, is the covariant derivative: 


Dy = ðu + iesgy; (4.2) 


és is the gauge coupling, later we will mostly use, in analogy with QED 
a; = —-; (4.3) 


g=} t^g where B A = 1,8, are the gluon fields and t4 are the SU (3) 
group generators in the triplet representation of quarks (i.e. t4 are 3 x 3 matrices 
acting on q); the generators obey the commutation relations [t^, t?] = iCagct© 
where CAgc are the complete antisymmetric structure constants of SU (3) (the 
normalisation of CA4gc and of e; is specified by Tr[t^t?] = 8^8 /2); 


FA, = dugi — dogi — esCAncabac (4.4) 


For quantisation the classical Lagrangian in Eq. (4.1) must be enlarged to contain 
gauge fixing and ghost terms, as described in Chap.2. The Feynman rules of 
QCD are listed in Fig. 4.1. The physical vertices in QCD include the gluon-quark- 
antiquark vertex, analogous to the QED photon-fermion-antifermion coupling, but 
also the 3-gluon and 4-gluon vertices, of order e, and e respectively, which have 
no analogue in an abelian theory like QED. 

The QCD lagrangian in Eq. (4.1) has a simple structure but a very rich dynamical 
content. It gives rise to a complex spectrum of hadrons, it implies the striking 
properties of confinement and asymptotic freedom, is endowed with an approximate 
chiral symmetry which is spontaneously broken, has a highly non trivial topological 
vacuum structure (instantons, U(1)4 symmetry breaking, strong CP violation 
(which is a problematic item in QCD possibly connected with new physics, like 
axions), ...), an intriguing phase transition diagram (colour deconfinement, quark- 
gluon plasma, chiral symmetry restoration, colour superconductivity, . . . ). 

Confinement is the property that no isolated coloured charge can exist but only 
colour singlet particles. For example, the potential between a quark and an antiquark 
has been studied on the lattice. It has a Coulomb part at short distances and a linearly 
rising term at long distances: 


as (r) 


Vag ~ Cri +....+or] (4.5) 


4 QCD: The Theory of Strong Interactions 85 


« SAB MV 
É l | g^* (1-4) qa | covariant 
Au By q tie q tie 
Gluon VVV AB 
q id | pz Mu E UC ay 

q^ ie (nq) (ng)? 

Quark ———— - i 
P p—m-ie 

;SAB 
Ghost m. id covariant 

p^ ie 

NA 

A T M voe E m 

let y en —eC^q covariant 

q AC, 
Bv 
q 

AL , 

eC" [g""(p— q^ + gq- + g^" (r—p)"] 

P0, 
GA 
AA Bu 
= ie; Fco g^ g^ — g^ gt”) 4 ceci g^ g^ — g^ gt 4 cce gg? = gi gP”) 

Cv Dp 


Fig. 4.1 Feynman rules for QCD. The solid lines represent the fermions, the curly lines the gluons, 
and the dotted lines represent the ghosts (see Chap. 2). The gauge parameter is denoted by A. The 
3-gluon vertex is written as if all gluon lines are outgoing 


where 


1 "v Ne-1 
Cr = — X) PP = C 4.6 
F KÈ (4.6) 


with Nc the number of colours (Nc = 3 in QCD). The scale dependence of 
as (the distance r is Fourier-conjugate to momentum transfer) will be explained 
in detail in the following. The understanding of the confinement mechanism has 
much improved thanks to lattice simulations of QCD at finite temperatures and 
densities. The slope decreases with increasing temperature until it vanishes at a 
critical temperature Tc. Above Tc the slope remains zero. The phase transitions 
of colour deconfinement and of chiral restauration appear to happen together on the 
lattice. A rapid transition is observed in lattice simulations where the energy density 
€ (T) is seen to sharply increase near the critical temperature for deconfinement and 
chiral restauration. The critical parameters and the nature of the phase transition 
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depend on the number of quark flavours n; and on their masses. For example, 
for nf = 2 or 2 + 1 (ie. two light u and d quarks and one heavier s quark), 
Tc ~ 175 MeV and e(Tc) ~ 0.5 — 1.0 GeV/fm?. For realistic values of the masses 
ms and m,a the phase transition appears to be a second order one, while it becomes 
first order for very small or very large m,_q,;. The hadronic phase and the deconfined 
phase are separated by a crossover line at small densities and by a critical line at 
high densities. Determining the exact location of the critical point in T and upg is 
an important challenge for theory which is also important for the interpretation of 
heavy ion collision experiments. At high densities the colour superconducting phase 
is also present with bosonic diquarks acting as Cooper pairs. 

A large investment is being done in experiments of heavy ion collisions with 
the aim of finding some evidence of the quark gluon plasma phase. Many exciting 
results have been found at the CERN SPS in the past years and more recently at 
RHIC. The status of the experimental search for the quark-gluon plasma will be 
reviewed in Chap. 7. 

The linearly rising term in the potential makes it energetically impossible to 
separate a q — q pair. If the pair is created at one space-time point, for example 
in e*e- annihilation, and then the quark and the antiquark start moving away from 
each other in the center of mass frame, it soon becomes energetically favourable 
to create additional pairs, smoothly distributed in rapidity between the two leading 
charges, which neutralise colour and allow the final state to be reorganised into two 
jets of colourless hadrons, that communicate in the central region by a number of 
"wee" hadrons with small energy. It is just like the familiar example of the broken 
magnet: if you try to isolate a magnetic pole by stretching a dipole, the magnet 
breaks down and two new poles appear at the breaking point. 

Confinement is essential to explain why nuclear forces have very short range 
while massless gluon exchange would be long range. Nucleons are colour singlets 
and they cannot exchange colour octet gluons but only colourless states. The lightest 
colour singlet hadronic particles are pions. So the range of nuclear forces is fixed by 
the pion mass r ~ m7! x 107! em: V © exp(—mgr)/r. 

Why SU(Nc = 3)colour? The selection of SU (3) as colour gauge group is 
unique in view of a number of constraints. (a) The group must admit complex 
representations because it must be able to distinguish a quark from an antiquark. 
In fact there are meson states made up of qq but not analogous qq bound states. 
Among simple groups this restricts the choice to SU(N) with N > 3, SO (4N + 2) 
with N > 2 (taking into account that SO (6) has the same algebra as SU (4)) and 
E (6). (b) The group must admit a completely antisymmetric colour singlet baryon 
made up of 3 quarks: qqq. In fact, from the study of hadron spectroscopy we 
know that the low lying baryons, completing an octet and a decuplet of (flavour) 
SU (3) (the approximate symmetry that rotate the three light quarks u, d and s), are 
made up of three quarks and are colour singlets. The qqq wave function must be 
completely antisymmetric in colour in order to agree with Fermi statistics. Indeed 
if we consider, for example, a N**^ with spin z-component +3/2, this is made 
up of (u fy u fj u T) in an s-state. Thus its wave function is totally symmetric in 
space, spin and flavour so that complete antisymmetry in colour is required by Fermi 
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statistics. In QCD this requirement is very simply satisfied by e;5:q^q^q^ where a, 
b, c are SU(3) colour indices. (c) The choice of SU (Nc = 3)colour is confirmed by 
many processes that directly measure Nc. Some examples are listed here. The total 
rate for hadronic production in ete~ annihilation is linear in Nc. Precisely if we 
consider R = o(ete~ > hadrons) /opoint (et €^ — pt) above bb threshold 
and below mz and we neglect small computable radiative corrections (that will be 
discussed later in Sect. 4.5) we have a sum of individual contributions (proportional 
to Q?, where Q is the electric charge in units of the proton charge) from q4 final 
states with q =u, c, d, s, b: 


R  NcD- 2 + 3-2) © New (4.7) 
ae 9 9 ' 
The data neatly indicate Nc = 3 as seen from Fig. 4.2 [9]. The slight excess of 
the data with respect to the value 11/3 is due to the QCD radiative corrections 
(Sect. 4.5). Similarly we can consider the branching ratio BQW- — e V), again 
in Born P dpi: The possible fermion-antifermion ( f f) final states are for 
f =e, W, , d, s (there is no f = b because the top quark is too heavy for bt 
to occur). Bach chanel gives the same contribution, except that for quarks we have 
Nc colours: 


B(W- > ev) ~x ———— (4.8) 
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Fig. 4.2. Comparison of the data on R — olete > hadrons)/6poini (et €^ — put) with 
the QCD prediction [9]. Nc = 3 is indicated 
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For Nc = 3 we obtain B = 11% and the experimental number is B = 10.7%. 
Another analogous example is the branching ratio B(t~ — e vov). From the final 
state channels with f =e , uw, d we find 


1 


Bit = € Ver) x 24 Nc 


(4.9) 


For Nc = 3 we obtain B = 20% and the experimental number is B = 18% (the less 
accuracy in this case is explained by the larger radiative and phase-space corrections 
because the mass of t^ is much smaller than mw). An important process that is 
quadratic in Nc is the rate ror? > 2y). This rate can be reliably calculated from a 
solid theorem in field theory which has to do with the chiral anomaly: 


ral + 2y) = ( 


yO 7.7 + 0.04) (——)* eV 4.1 
3 ) 3253 f2 (7.73 £ DOC "e (4.10) 


where the prediction is obtained for fr = (130.7 + 0.37) MeV. The experimental 
resultis I = (7.7-£0.5) eV in remarkable agreement with Nc = 3. There are many 
more experimental confirmations that Nc = 3: for example the rate for Drell-Yan 
processes (see Sect. 5.4) is inversely proportional to Nc. 

How do we get testable predictions from QCD? On the one hand there are non 
perturbative methods. The most important at present is the technique of lattice 
simulations: it is based on first principles, it has produced very valuable results on 
confinement, phase transitions, bound states, hadronic matrix elements and so on, 
and it is by now an established basic tool. The main limitation is from computing 
power and therefore there is continuous progress and a lot of good perspectives 
for the future. Another class of approaches is based on effective lagrangians which 
provide simpler approximations than the full theory, valid in some definite domain 
of physical conditions. Chiral lagrangians are based on soft pion theorems and are 
valid for suitable processes at energies below 1 GeV. Heavy quark effective theories 
are obtained from expanding in inverse powers of the heavy quark mass and are 
mainly important for the study of b and, to less accuracy, c decays. The approach of 
QCD sum rules has led to interesting results but appears to offer not much potential 
for further development. Similarly specific potential models for quarkonium have a 
limited range of application. On the other hand, the perturbative approach, based on 
asymptotic freedom, still remains the main quantitative connection to experiment, 
due to its wide range of applicability to all sorts of “hard” processes. To perturbative 
QCD will be devoted the next sections. 


4.2 Massless QCD and Scale Invariance 


As discussed in Chap. 2, the QCD lagrangian in Eq. (4.1) only specifies the theory 
at the classical level. The procedure for quantisation of gauge theories involves a 
number of complications that arise from the fact that not all degrees of freedom of 
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gauge fields are physical because of the constraints from gauge invariance which 
can be used to eliminate the dependent variables. This is already true for abelian 
theories and we are familiar with the QED case. One introduces a gauge fixing term 
(an additional term in the lagrangian density that acts as a Lagrange multiplier in 
the action extremisation). One can choose to preserve manifest Lorentz invariance. 
In this case, one adopts a covariant gauge, like the Lorentz gauge, and in QED 
one proceeds according to the formalism of Gupta-Bleuler. Or one can give up 
explicit formal covariance and work in a non covariant gauge, like the Coulomb 
or the axial gauges, and only quantise the physical degrees of freedom (in QED the 
transverse components of the photon field). While this is all for an abelian gauge 
theory, in the non-abelian case some additional complications arise, in particular 
the necessity to introduce ghosts for the formulation of Feynman rules. As we 
have seen, there are in general as many ghost fields as gauge bosons and they 
appear in the form of a transformation Jacobian in the Feynman diagram functional 
integral. Ghosts only propagate in closed loops and their vertices with gluons can be 
included as additional terms in the lagrangian density which are fixed once the gauge 
fixing terms and their infinitesimal gauge transformations are specified. Finally 
the complete Feynman rules in a given gauge can be obtained and they appear in 
Fig. 4.1. 

Once the Feynman rules are derived we have a formal perturbative expansion 
but loop diagrams generate infinities. First a regularisation must be introduced, 
compatible with gauge symmetry and Lorentz invariance. This is possible in QCD. 
In principle one can introduce a cut-off K (with dimensions of energy), for example, 
a’ la Pauli- Villars. But at present the universally adopted regularisation procedure is 
dimensional regularisation that we will briefly describe later on. After regularisation 
the next step is renormalisation. In a renormalisable theory (like for all gauge 
theories in four spacetime dimensions and for QCD in particular) the dependence 
on the cutoff can be completely reabsorbed in a redefinition of particle masses, 
of gauge coupling(s) and of wave function normalisations. After renormalisation 
is achieved the perturbative definition of the quantum theory that corresponds to 
a classical lagrangian like in Eq. (4.1) is completed. In the QCD Lagrangian of 
Eq. (4.1) quark masses are the only parameters with physical dimensions (we work 
in the natural system of units  — c = 1). Naively we would expect that massless 
QCD is scale invariant. This is actually true at the classical level. Scale invariance 
implies that dimensionless observables should not depend on the absolute scale of 
energy but only on ratios of energy-dimensional variables. The massless limit should 
be relevant for the asymptotic large energy limit of processes which are non singular 
form — 0. 

The naive expectation that massless QCD should be scale invariant is false in 
the quantum theory. The scale symmetry of the classical theory is unavoidably 
destroyed by the regularisation and renormalisation procedure which introduce a 
dimensional parameter in the quantum version of the theory. When a symmetry 
of the classical theory is necessarily destroyed by quantisation, regularisation and 
renormalisation one talks of an “anomaly”. So, in this sense, scale invariance in 
massless QCD is anomalous. 
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While massless QCD is finally not scale invariant, the departures from scaling 
are asymptotically small, logarithmic and computable. In massive QCD there are 
additional mass corrections suppressed by powers of m/E, where E is the energy 
scale (for non singular processes in the limit m — 0). At the parton level (q and 
£) we can conceive to apply the asymptotics from massless QCD to processes and 
observables (we use the word "processes" for both) with the following properties 
(“hard processes"). (a) All relevant energy variables must be large: 


E; = ziQ. Q >> mj; zi: scaling variables o(1) (4.11) 


(b) There should be no infrared singularities (one talks of “infrared safe” processes). 
(c) The processes concerned must be finite for m — 0 (no mass singularities). 
To possibly satisfy these criteria processes must be as "inclusive" as possible: 
one should include all final states with massless gluon emission and add all mass 
degenerate final states (given that quarks are massless also q — q pairs can be 
massless if “collinear”, that is moving together in the same direction at the common 
speed of light). 

In perturbative QCD one computes inclusive rates for partons (the fields in the 
lagrangian, that is, in QCD, quarks and gluons) and takes them as equal to rates 
for hadrons. Partons and hadrons are considered as two equivalent sets of complete 
states. This is called “global duality” and it is rather safe in the rare instance of a 
totally inclusive final state. It is less so for distributions, like distributions in the 
invariant mass M (“local duality”) where it can be reliable only if smeared over a 
sufficiently wide bin in M. 

Let us discuss more in detail infrared and collinear safety. Consider, for example, 
a quark virtual line that ends up into a real quark plus a real gluon (Fig. 4.3). 

For the propagator we have: 


1 1 1 1 
ropagator = ———————— = ——— = ————.-——————— 
PES (ptky2—m? Xp. 2EKEp 1- fpcosó 


(4.12) 
Since the gluon is massless, E, can vanish and this corresponds to an infrared 
singularity. Remember that we have to take the square of the amplitude and integrate 
over the final state phase space, or, in this case, all together, d Eg/ Eg. Indeed we 
get 1/ E? from the squared amplitude and d?k/ Ep ~ Ed E; from the phase space. 


Fig. 4.3 The splitting of a k 
virtual quark into a quark and 
a gluon 
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Fig. 4.4 The diagrams contributing to the total cross-section e* e^ — hadrons at order œs. For 


simplicity, only the final state quarks and (virtual or real) gluons are drawn 


et 


Fig. 4.5 The total cross-section et e^ — hadrons 


Also, form — 0, Bp = ,/1 — m?/E? — 1 and (1— fj cos 0) vanishes at cos @ = 1. 
This leads to a collinear mass singularity. 

There are two very important theorems on infrared and mass singularities. The 
first one is the Bloch-Nordsieck theorem [8]: infrared singularities cancel between 
real and virtual diagrams (see Fig. 4.4) when all resolution indistinguishable final 
states are added up. For example, for each real detector there is a minimum energy 
of gluon radiation that can be detected. For the cancellation of infrared divergences, 
one should add all possible gluon emission with a total energy below the detectable 
minimum. The second one is the Kinoshita-Lee, Nauenberg theorem [10]: mass 
singularities connected with an external particle of mass m are canceled if all 
degenerate states (that is with the same mass) are summed up. That is for a final 
state particle of mass m we should add all final states that in the limit m — 0 have 
the same mass, also including gluons and massless pairs. If a completely inclusive 
final state is taken, only the mass singularities from the initial state particles remain 
(we shall see that they will be absorbed inside the non perturbative parton densities, 
which are probability densities of finding the given parton in the initial hadron). 

Hard processes to which the massless QCD asymptotics can possibly apply must 
be infrared and collinear safe, that is they must satisfy the requirements from the 
Bloch-Nordsieck and the Kinoshita-Lee-Nauenberg theorems. We give now some 
examples of important hard processes. One of the simplest hard processes is the 
totally inclusive cross section for hadron production in et e~ annihilation, Fig. 4.5, 
parameterised in terms of the already mentioned dimensionless observable R = 
o(e*e- — hadrons) /opoini(ete~ — utu”). The pointlike cross section in the 
denominator is given by Opoint = Azo? /3s, where s — Q? — AE? is the squared 
total center of mass energy and Q is the mass of the exchanged virtual gauge boson. 
At parton level the final state is (qq + n g + n'q'q') and n and m’ are limited at 
each order of perturbation theory. It is assumed that the conversion of partons into 
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Fig. 4.6 Deep inelastic 
lepto-production 


hadrons does not affect the rate (it happens with probability 1). We have already 
mentioned that in order for this to be true within a given accuracy an averaging over 
a sufficiently large bin of Q must be understood. The binning width is larger in the 
vicinity of thresholds: for example when one goes across the charm cc threshold 
the physical cross-section shows resonance bumps which are absent in the smooth 
partonic counterpart which however gives an average of the cross-section. 

A very important class of hard processes is Deep Inelastic Scattering (DIS) 


L+ No +x l= e, u*, v, v (4.13) 


which has played and still plays a very important role for our understanding of QCD 
and of nucleon structure. For the processes in Eq. (4.13), shown in Fig. 4.6, we have, 
in the lab system where the nucleon of mass m is at rest: 


Q? = -q = —(k—Ky = 4EF' sin’ 6/2; mv = (p.q); x= — 


In this case the virtual momentum q of the gauge boson is spacelike. x is the 
familiar Bjorken variable. The DIS processes in QCD will be extensively discussed 
in Sect. 4.5 


4.3 The Renormalisation Group and Asymptotic Freedom 


In this section we aim at providing a reasonably detailed introduction to the 
renormalisation group formalism and the concept of running coupling which leads 
to the result that QCD has the property of asymptotic freedom. We start with a 
summary on how renormalisation works. 

In the simplest conceptual situation imagine that we implement regularisation of 
divergent integrals by introducing a dimensional cut-off K that respects gauge and 
Lorentz invariance. The dependence of renormalised quantities on K is eliminated 
by absorbing it into a redefinition of m (the quark mass: for simplicity we assume 
a single flavour here), the gauge coupling e (can be e in QED or es in QCD) 
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and the wave function renormalisation factors zy’ E for q and g, using suitable 


renormalisation conditions (that is precise definitions of m, g and Z that can be 
implemented order by order in perturbation theory). For example we can define 
the renormalised mass m as the position of the pole in the quark propagator and, 
similarly, the normalisation Z, as the residue at the pole: 


Z 
Propagator = —,— —, + no- pole terms (4.15) 
p^—m 


The renormalised coupling e can be defined in terms of a renormalised 3-point 
vertex at some specified values of the external momenta. Precisely, we consider 
a one particle irreducible vertex (1PI). We recall that a connected Green function 
is the sum of all connected diagrams, while 1PI Green functions are the sum of all 
diagrams that cannot be separated into two disconnected parts by cutting only one 
line. 

We now become more specific by concentrating on the case of massless QCD. If 
we start from a vanishing mass at the classical (or “bare’’) level, mo = 0, the mass 
is not renormalised because it is protected by a symmetry, chiral symmetry. The 
conserved currents of chiral symmetry are axial currents: 4y, ysq. The divergence 
of the axial current gives, by using the Dirac equation, 0“ (qy,ysq) = 2mqysq.So 
the axial current and the corresponding axial charge are conserved in the massless 
limit. Since QCD is a vector theory we have not to worry about chiral anomalies 
in this respect. So one can choose a regularisation that preserves chiral symmetry 
besides gauge and Lorentz symmetry. Then the renormalised mass remains zero. 
The renormalised propagator has the form in Eq. (4.15) with m = 0. 

The renormalised coupling e; can be defined from the renormalised 1PI 3-gluon 
vertex at a scale —yu? (Fig. 4.7): 


-3n 
Vsare(p^, q?, r?) = ZVpen( p’, q^, r2), Z = Zg / , Vren( u^. Hu. u?) > ês 
(4.16) 


We could as well use the quark-gluon vertex or any other vertex which coincides 
with eo in lowest order (even the ghost-gluon vertex, if we want). With a regularisa- 
tion and renormalisation that preserves gauge invariance we are guaranteed that all 
these different definitions are equivalent. 

Here Vpare is what is obtained from computing the Feynman diagrams including, 
for example, the 1-loop corrections at the lowest non trivial order (Vpare is defined 


Fig. 4.7 Diagrams 2 
contributing to the 1PI 

3-gluon vertex at the one-loop 

approximation level T T +... 
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as the scalar function multiplying the vertex tensor, normalised in such a way that it 
coincides with eso in lowest order). Vbare contains the cut-off K but does not know 
about u. Z is a factor that depends both on the cut-off and on u but not on momenta. 
Because of infrared singularities the defining scale u cannot vanish. The negative 
value —4? < 0 is chosen to stay away from physical cuts (a gluon with negative 
virtual mass cannot decay). Similarly, in the massless theory, we can define zc as 
the inverse gluon propagator (the 1PI 2-point function) at the same scale — u? (the 
vanishing mass of the gluon is guaranteed by gauge invariance). 
After computing all 1-loop diagrams indicated in Fig. 4.7 we have: 


2.2 2 K? = 
Vbare( p“, p^, p^) = eu Li reote = log m Tel = 


2 2 


= [1+cas-log—; + ..leos[1 + cos -log —5- +...] 
=u p 
p? 
= Zy'eos[1 + cos - log E +..] 
K2 = 
= [1+da,; - log z E .. Jes[1 + cas - log 5 T... 
-u p 
m (4.17) 


Note the replacement of eo with e in the second step, compensated by changing 
c into d in the first bracket (corresponding to eo = Zg ag ve). The definition 
of e, demands that one precisely specifies what is included in Z. For this, in a 
given renormalisation scheme, a prescription is fixed to specify the finite terms 
that go into Z (i.e. the terms of order œs that accompany log K?). Then V,en is 
specified and the renormalised coupling is defined from it according to Eq. (4.16). 
For example, in the momentum subtraction scheme we define V,en( ae p p^) = 
es + Viare p p’, p^) Voare( H^ ys p) which is equivalent to say, at 
1-loop, that all finite terms that do not vanish at p? = — u? are included in Z. 

A crucial observation is that V54,, depends on K but not on u, which is only 
introduced when Z, V;e, and hence o; are defined. (From here on, for shorthand, 
we write o to indicate either the QED coupling or the QCD coupling œs). More in 
general for a generic Green function G, we similarly have: 


Gpare(K”, a0, pP) = ZGGren(u, a, p”) (4.18) 


so that we have: 


= [ZGGren] = 0 (4.19) 
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or 


ð da 0 1 9Zg 
—————— —— + — 
8log u? 8log u? da Zg 0 log u? 


Zol ]Gren —-0 (4.20) 


Finally the renormalisation group equation (RGE) can be written as: 


ð 0 
[—— + B(@)— + ye(@)|Gren = 0 (4.21) 
ð log jt da 
where 
(a (4.22) 
VERE 8 log u? f 
and 
ð log Z 
yola) = Efe (4.23) 
0 logu 


Note that £ (o) does not depend on which Green function G we are considering, but 
it is a property of the theory and the renormalisation scheme adopted, while yc (o) 
also depends on G. Strictly speaking the RGE as written above is only valid in the 
Landau gauge (A = 0). In other gauges an additional term that takes the variation 
of the gauge fixing parameter à should also be included. We omit this term, for 
simplicity, as it is not relevant at the 1-loop level. 

Assume that we want to apply the RGE to some hard process at a large scale 
Q, related to a Green function G that we can always take as dimensionless (by 
multiplication by a suitable power of Q). Since the interesting dependence on Q 
will be logarithmic we introduce the variable t as : 


2 
t = lic (4.24) 
H 


Then we can write Gen = F(t, o, xj) where x; are scaling variables (we often omit 
to write them in the following). In the naive scaling limit F should be independent 
of t. To find the actual dependence on t, we want to solve the RGE 


[i 9 
USE) yel = 0 (4.25) 


with a given boundary condition at t = 0 (or Q? = u?): F(0, o). 

We first solve the RGE in the simplest case that yc; (o) = 0. This is not an 
unphysical case: for example, it applies to R,+,- where the vanishing of y is related 
to the non renormalisation of the electric charge in QCD (otherwise the proton and 
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the electron charge would not exactly compensate: this will be better explained in 
Sect. 4.5). So we consider the equation: 


3 ð 
E s B(a) 3 1Gren =0 (4.26) 
The solution is simply 
F(t,a) = F|0,a(t)] (4.27) 


where the “running coupling” a(t) is defined by: 


a(t) 1 
t= f —— da' (4.28) 
a PW) 


Note that from this definition it follows that æ(0) = o, so that the boundary 
condition is also satisfied. To prove that F[0, a(t)] is indeed the solution, we first 
take derivatives with respect of t and o (the two independent variables) of both sides 
of Eq. (4.28). By taking d/dt we obtain 


1 a 
= wo (4.29) 
B(o(t) 8t 
We then take d /do and obtain 
1 1 datt 
0 = 4: 2 (4.30) 


— B@) ^ Balt) da 


These two relations make explicit the dependence of the running coupling on t and 
a: 


da(t) _ 

<= Blatt) in 
da(t) " B(a(t)) (4.32) 
ða B(a) 


Using these two equations one immediately checks that F[0, a(t)] is indeed the 
solution. 

Similarly, one finds that the solution of the more general equation with y Æ 0, 
Eq. (4.25), is given by: 


a(t) / 
F(t,a) = FIO. a(r)lexp f VO) ay (4.33) 


a Ba’) 
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In fact the sum of the two derivatives acting on the factor F[0, w(t)] vanishes and the 
exponential is by itself a solution of the complete equation. Note that the boundary 
condition is also satisfied. 

The important point is the appearance of the running coupling that determines the 
asymptotic departures from scaling. The next step is to study the functional form of 
the running coupling. From Eq. (4.31) we see that the rate of change with t of the 
running coupling is determined by the £ function. In turn £(o) is determined by 
the u dependence of the renormalised coupling through Eq. (4.22). Clearly there 
is no dependence on u of the basic 3-gluon vertex in lowest order (order e). The 
dependence starts at 1-loop, that is at order e? (one extra gluon has to be emitted 
and reabsorbed). Thus we obtain that in perturbation theory: 


ð 
— —À, «e (4.34) 
ð log u? 
Recalling that œ = e*/4z, we have: 
ð 
l ar ouo (4.35) 
3 log u? 3 log u? 
Thus the behaviour of B (a) in perturbation theory is as follows: 
ba) = +bo7[1 + ba 4 ...] (4.36) 


Since the sign of the leading term is crucial in the following discussion, we stipulate 
that always b > O and we make the sign explicit in front. 

Let us make the procedure for computing the 1-loop beta function in QCD (or, 
similarly, in QED) more precise. The result of the 11oop 1PI diagrams for V,-¢, can 
be written down as (we denote e, and o; by e and a, for shorthand): 


2 


Viren = e[1 + a Bag log —, +...] (4.37) 
=P 
V;en satisfies the RGE: 
[ ° + pate : (@)|Vren = 0 (4.38) 
ð log u? d c E . 


With respect to Eq. (4.21) the beta function term has been rewritten taking into 
account that V,en starts with e and the anomalous dimension term arises from 
a factor Z, 1/2 for each gluon leg. In general for a n-leg 1PI Green function 


Vi.bare = a i Va,ren, if all external legs are gluons. Note that in the particular case 
of V = V3 that is used to define e other Z factors are absorbed in the replacement 
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Z! Z3 eg = e. At 1-loop accuracy we replace B(a) = —bo? and yg(a) = yd a. 


All together one obtains: 


3 

b = 2(B3g — 2/42 (4.39) 
Similarly we can write the diagrammatic expression and the RGE for the 1PI 2- 
gluon Green function which is the inverse gluon propagator II (a scalar function 
after removing the gauge invariant tensor): 


2. 
Hren = [1 + aBa log ^5 «I (4.40) 


and 


ð 
+ B(a)— — Yeo) Tren = 0 (4.41) 


l3 log u? da 


Notice that the normalisation and the phase of II are specified by the lowest order 
term being one. In this case the f function term is negligible being of order o? 
(because IT is a function of e only through o). and we obtain: 


y = Bog (4.42) 


Thus, finally: 
3 
b = 2(B3, — 5 P») (4.43) 


By direct calculation at 1-loop one finds: 
Nc Q? 
QED: Bla) ~ +ba? +..... b = ya (4.44) 


where Nc = 3 for quarks and Nc = 1 for leptons and the sum runs over all fermions 
of charge Q;e that are coupled. Also, one finds: 


11Nc — 2n; 


QCD: Bla) ~ —bo? +..... b= 
127 


(4.45) 


where, as usual, n f is the number of coupled flavours of quarks (we assume here 
thatnf < 16 so that b > 0 in QCD). If o'(r) is small we can compute B(a(t)) in 
perturbation theory. The sign in front of b then decides the slope of the coupling: 
a(t) increases with t (or Q?^) if P is positive at small œ (QED), or w(t) decreases with 


t (or Q?) if P is negative at small a (QCD). A theory like QCD where the running 
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coupling vanishes asymptotically at large Q? is called (ultraviolet) *asymptotically 
free". An important result that has been proven is that in four spacetime dimensions 
all and only non-abelian gauge theories are asymptotically free. 

Going back to Eq. (4.28) we replace (a) ^ + bo, do the integral and perform 
a simple algebra. We find 


QED: aH ~ — (4.46) 
] — bat 
and 
QCD : (f) E (4.47) 
ZEN: 14 bat 


A slightly different form is often used in QCD. Defining 1/y = blog u*/Adcp 
we can write: 


1 1 1 
a(t) ~ = —— s = ———— (4.48) 
+ + bt blog $— + blog € blog ie 
QCD H QCD 


We see that a(t) decreases logarithmically with Q? and that one can introduce a 
dimensional parameter A oc p that replaces u. Often in the following we will simply 
write A for A gc p. Note that it is clear that A depends on the particular definition of 
a, not only on the defining scale u but also on the renormalisation scheme (see, for 
example, the discussion in the next session). Through the parameter b, and in general 
through the £ function, it also depends on the number n f of coupled flavours. It is 
very important to note that QED and QCD are theories with “decoupling”: up to the 
scale Q only quarks with masses m << Q contribute to the running of o. This is 
clearly very important, given that all applications of perturbative QCD so far apply 
to energies below the top quark mass rn;. For the validity of the decoupling theorem 
[11] it is necessary that the theory where all the heavy particle internal lines are 
eliminated is still renormalisable and that the coupling constants do not vary with 
the mass. These requirements are true for the mass of heavy quarks in QED and 
QCD, but are not true in the electroweak theory where the elimination of the top 
would violate SU (2) symmetry (because the t and b left quarks are in a doublet) and 
the quark couplings to the Higgs multiplet (hence to the longitudinal gauge bosons) 
are proportional to the mass. In conclusion, in QED and QCD, quarks with m >> Q 
do not contribute to n in the coefficients of the relevant 6 function. The effects of 
heavy quarks are power suppressed and can be taken separately into account. For 
example, in ete~ annihilation for 2m, < Q < 2m; the relevant asymptotics is for 
ny = 4, while for 2mp < Q < 2m; ny = 5. Going accross the b threshold the £ 
function coefficients change, so the a(t) slope changes. But a(t) is continuous, so 
that A changes so as to keep constant a(f) at the matching point at Q ~ o(2mp). 
The effect on A is large: approximately A5 ^ 0.65A4. 
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Note the presence of a pole in Eqs. (4.46, 4.47) at tbat = 1, called the Landau 
pole, who realised its existence in QED already in the ‘50’s. For u ~ me (in QED) 
the pole occurs beyond the Planck mass. In QCD the Landau pole is located for 
negative t or at Q < u in the region of light hadron masses. Clearly the issue of the 
definition and the behaviour of the physical coupling (which is always finite, when 
defined in terms of some physical process) in the region around the perturbative 
Landau pole is a problem that lies outside the domain of perturbative QCD. 

The non leading terms in the asymptotic behaviour of the running coupling can in 
principle be evaluated going back to Eq. (4.36) and computing b’ at 2-loops and so 
on. But in general the perturbative coefficients of 6(@) depend on the definition of 
the renormalised coupling o (the renormalisation scheme), so one wonders whether 
it is worthwhile to do a complicated calculation to get b’ if then it must be repeated 
for a different definition or scheme. In this respect it is interesting to remark that 
actually both b and b’ are independent of the definition of o, while higher order 
coefficients do depend on that. Here is the simple proof. Two different perturbative 
definitions of a are related by a’ ^ a(1 + cya + ...). Then we have: 


Bia’) d E eea 
[04 SSS = ee [04 "T 
d log u? d log u? m 


= cbo?^(l + b'a + ..)0 + 2c19 + ...) 
= +ba?(1 + b'a! +...) (4.49) 


which shows that, up to the first subleading order, B(a’) has the same form as f (a). 
In QCD (Nc = 3) one has calculated: 


, _ 1053- I9n; (4.50) 
«— 2n (33 — 2ny) l 


By taking b’ into account one can write the expression of the running coupling at 
next to the leading order (NLO): 


2 
a(Q^ = oro(Q5[1 — M'aLo (Q^) loglog S; Ee (4.51) 


where or — blog Q?/ A? is the LO result (actually at NLO the definition of A is 
modified according to b log u?/ A? = 1/a + b' log ba). 

Summarizing, we started from massless classical QCD which is scale invariant. 
But we have seen that the procedure of quantisation, regularisation and renormal- 
isation necessarily breaks scale invariance. In the quantum QCD theory there is 
a scale of energy, A, which from experiment is of the order of a few hundred 
MeV, its precise value depending on the definition, as we shall see in detail. 
Dimensionless quantities depend on the energy scale through the running coupling 
which is a logarithmic function of Q?/ A?. In QCD the running coupling decreases 
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logarithmically at large Q? (asymptotic freedom), while in QED the coupling has 
the opposite behaviour. 


4.4 More on the Running Coupling 


In the previous section we have introduced the renormalised coupling o in terms of 
the 3-gluon vertex at p? = —u? (momentum subtraction). The Ward identities of 
QCD then ensure that the coupling defined from other vertices like the ggg vertex 
are renormalised in the same way and the finite radiative corrections are related. 
But at present the universally adopted definition of œs is in terms of dimensional 
regularisation because of computational simplicity which is essential given the great 
complexity of present day calculations. So we now briefly review the principles 
of dimensional regularisation and the definition of Minimal Subtraction (M S) and 
Modified Minimal Subtraction (WS). The MS definition of o; is the one most 
commonly adopted in the literature and a value quoted for it is nomally referring 
to this definition. 

Dimensional Regularisation (DR) is a gauge and Lorentz invariant regularisation 
that consists in formulating the theory in D < 4 spacetime dimensions in order to 
make loop integrals ultraviolet finite. In DR one rewrites the theory in D dimensions 
(D is integer at the beginning, but then we will see that the expression of diagrams 
makes sense at all D except for isolated singularities). The metric tensor is extended 
into a D x D matrix g,, = diag(1, —1, —1,...., —1) and 4-vectors are given by 
k^" = (K0,kl,... , kP-!). The Dirac y" are f(D) x f (D) matrices and it is not 
important what is the precise form of the function f(D). It is sufficient to extend 
the usual algebra in a straightforward way like (yy, Yv} = 28,,v1, with J the D- 
dimensional identity matrix, y^y"y, = —(D—2)y"orTr(y"y") = f(D)gyv. 

The physical dimensions of fields change in D dimensions and, as a consequence, 
the gauge couplings become dimensional ep = Se, where e is dimensionless, 
D = 4—2eand u is a scale of mass (this is how a scale of mass is introduced in 
the DR of massless QCD!). The dimension of fields is determined by requiring that 
the action S = fd D y f; is dimensionless. By inserting for £ terms like mWW or 
mo'¢ or eV y WA y, the dimensions of the fields and coupling are determined as: 
m,V,o,A,,e = 1,(D—1)/2, (D — 2)/2, (D — 2)/2, (4 — D)/2, respectively. 
The formal expression of loop integrals can be written for any D. For example: 


dPk 1 rQ- D/2)(—m2)P/2-2 
Qm) (m2 (mn (4.52) 


For D = 4 — 2e one can expand using: 


1 
r(e) = — — yg + of), yg = 0.5772..... (4.53) 
€ 
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For some Green function G, normalised to one in lowest order, (like V/e with V the 
3-g vertex function at the symmetric point p? — q? — r?, considered in the previous 
section) we typically find at 1-loop: 


2 
= 1 
Goare = 1 + (CX [BG +log4r — yr) + A + eO) — (454 


In MS one rewrites this at 1-loop accuracy (diagram by diagram: this is a virtue of 
the method): 


Gbare = ZGren 


1 
Z=lt+a LBC + log4z — yg)] 


=p? 


Gren = 1 + a [Blog 3 
p 


+A] (4.55) 


Here Z stands for the relevant product of renormalisation factors. In the original M S 
prescription only 1/e was subtracted (that clearly plays the role of a cutoff) and not 
also log 4z and yz. Later, since these constants always appear from the expansion 
of T functions it was decided to modify MS into M S. Note that the M S definition 
of æ is different than that in the momentum subtraction scheme because the finite 
terms (those beyond logs) are different. In particular here ôG,en does not vanish at 
fae. 

The third [12] and fourth [13] coefficients of the QCD f function are also 
known in the M S prescription (recall that only the first two coefficients are scheme 
independent). The calculation of the last term involved the evaluation of some 
50,000 4-loop diagrams. Translated in numbers, for n f = 5 one obtains : 


Bla) = —0.61002[1 + 1.261...— + 1.475...(—)? + 9.836... (—)...] 
JT m H 
(4.56) 


It is interesting to remark that the expansion coefficients are all of order 1 or (10 for 
the last one), so that the MS expansion looks reasonably well behaved. 

It is important to keep in mind that the QED and QCD perturbative series, 
after renormalisation, have all their coefficients finite, but the expansion does not 
converge. Actually the perturbative series are not even Borel summable. After Borel 
resummation for a given process one is left with a result which is ambiguous 
by terms typically down by exp —n/(bo), with n an integer and b the first £ 
function coefficient. In QED these corrective terms are extremely small and not very 
important in practice. On the contrary in QCD a = o,(Q?) ~ 1/(blog Q?/A?) 
and the ambiguous terms are of order (1/ Q?)", that is are power suppressed. It is 
interesting that, through this mechanism, the perturbative version of the theory is 
able to somehow take into account the power suppressed corrections. A sequence 
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of diagrams with factorial growth at large order n is made up by dressing gluon 
propagators by any number of quark bubbles together with their gauge completions 
(renormalons).The problem of the precise relation between the ambiguities of the 
perturbative expansion and the higher twist corrections has been discussed in recent 
years [14]. 


4.5 Application to Hard Processes 


4.5.1  R,«,- and Related Processes 


The simplest hard process is R,+,-- that we have already started to discuss. R is 
dimensionless and in perturbation theory is given by R = Nc}; Q?F (t, os), 
where F = 1 + (as). We have already mentioned that for this process the 
“anomalous dimension" function vanishes: y (œs) = 0 because of electric charge 
non renormalisation by strong interactions. Let us review how this happens in detail. 
The diagrams that are relevant for charge renormalisation in QED at 1-loop are 
shown in Fig. 4.8. The Ward identity that follows from gauge invariance in QED 
imposes that the vertex (Zy) and the self-energy (Z f) renormalisation factors cancel 
and the only divergence remains in Z,, the vacuum polarization of the photon. So 
the charge is only renormalised by the photon blob, hence it is universal (the same 
factor for all fermions, independent of their charge) and is not affected by QCD 
at 1-loop. It is true that at higher orders the photon vacuum polarization diagram 
is affected by QCD (for example, at 2-loops we can exchange a gluon between 
the quarks in the photon loop) but the renormalisation induced by the vacuum 
polarisation diagram remains independent of the nature of the fermion to which 
the photon line is attached. The gluon contributions to the vertex (Zy) and to the 


f f f 
+ + + 
Y B" Y Jy 7 i4 B 
f f f 
Zy Zy 


-l 
| 


Zi Zi 


Fig. 4.8 Diagrams for charge renormalisation in QED at 1-loop 
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Fig. 4.9 Real and virtual diagrams relevant for the computation of R at 1-loop accuracy 


self-energy (Z f) cancel because they have exactly the same structure as in QED, so 
that y (o,) = 0. 

At 1-loop the diagrams relevant for the computation of R are shown in Fig. 4.9. 
There are virtual diagrams and real diagrams with one additional gluon in the 
final state. Infrared divergences cancel between the interference term of the virtual 
diagrams and the absolute square of the real diagrams, according to the Bloch- 
Nordsieck theorem. Similarly there are no mass singularities, in agreement with 
the Kinoshita-Lee-Nauenberg theorem, because the initial state is purely leptonic 
and all degenerate states that can appear at the given order are included in the final 
state. Given that y (œs) = 0 the RGE prediction is simply given, as we have already 
seen, by F(t,a;) = F[0, æs(t)]. This means that if we do, for example, a 2-loop 
calculation, we must obtain a result of the form: 


F(t,as) = 1 + cios (1 — bast) + coa? + o(o2) (4.57) 


In fact we see that this form, taking into account that from Eq. (4.47) we have: 


a(t) ~ ~a (l — bast + ....) (4.58) 


Qs 
1+ bast 
can be rewritten as 

F(t, as) = 1 + cras(t) + ca? (t) + o(@3(t)) = FIO, as(t)] (4.59) 


The content of the RGE prediction is, at this order, that there are no ast and (ast)? 
terms (the leading log sequence must be absent) and the term of order at has the 
coefficient that allows to reabsorb it in the transformation of œs into o, (t). 

At present the first three coefficients have been computed in the MS scheme 
[15]. Clearly c1 = 1/z does not depend on the definition of œs but c» and c3 do. 
The subleading coefficients also depend on the scale choice: if instead of expanding 
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in œs (Q) we decide to choose o; ( Q/2) the coefficients c2 and c3 change. In the MS 
scheme, for y exchange and n f = 5, which are good approximations for 2mp << 
Q «« mz, one has: 


t t t 

F[0,o,(0] = 1-4 EË 4 1.409... (5002. 12.8... 3 4... (4.60) 
T T T 

Similar perturbative results at 3-loop accuracy also exist for Rz = T'(Z — 


hadrons)/ V(Z — leptons), Rr = V(r > v-Fhadrons)/ F (x — v, +leptons), 
etc. We will discuss these results later when we deal with measurements of ay. 

The perturbative expansion in powers of a, (t) takes into account all contributions 
that are suppressed by powers of logarithms of the large scale Q? ("leading twist" 
terms). In addition there are corrections suppressed by powers of the large scale 
Q (“higher twist” terms). The pattern of power corrections is controlled by the 
light-cone Operator Product Expansion (OPE) [16] which (schematically) leads to: 


m2 < O|Tr[F,,, F""]|0 > < 0/06|0 > 
F = pert. + mol T rn; EE P exe E LET. NN 


(4.61) 


Here m? generically indicates mass corrections, notably from b quarks, for example 
(t quark mass corrections only arise from loops, vanish in the limit m; — oo and 
are included in the coefficients as those in Eq. (4.60) and the analogous ones for 
higher twist terms), Fay = )°, F nds Og is typically a 4-fermion operator, etc. 
For each possible gauge invariant operator the corresponding power of Q? is fixed 
by dimensions. 

We now consider the light-cone OPE in some more detail. R:«,- ~ II(Q?) 
where (Q?) is the scalar spectral function related to the hadronic contribution to 
the imaginary part of the photon vacuum polarization Ty»: 


Tuv 


CQ? + quad) = f expigx < OIOI > dx = 


= X < 0J} On >< n|J (00 > (27)48t (q — pn) (4.62) 


n 


For Q? — oo the x? — 0 region is dominant. To all orders in perturbation theory 
the OPE can be proven. Schematically, dropping Lorentz indices, for simplicity, 
near x? ^ 0 we have: 


HIO = 162) + EG!) Y naa xh ON, O 
n=0 


+ less sing. terms (4.63) 
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Here I(x”), E(x?),. ere CR (x?) are c-number singular functions, O” is a string of 
local operators. E (x2) is the singularity of free field theory, (x?) and c; (x2) 
contain powers of log 2 x? in interaction. Some O” are already present in free field 
theory, other ones appear when interactions are switched on. Given that TI (Q?) is 
related to the Fourier transform of the vacuum expectation value of the product 
of currents, less singular terms in x? lead to power suppressed terms in 1/ Q?. The 
perturbative terms come from J (x?) which is the leading twist term. The logarithmic 
scaling violations induced by the running coupling are the logs in I (x?). 


4.5.2 The Final State in et e- Annihilation 


Experiments on e*e^ annihilation at high energy provide a remarkable possibility 
of systematically testing the distinct signatures predicted by QCD for the structure of 
the final state averaged over a large number of events. Typical of asymptotic freedom 
is the hierarchy of configurations emerging as a consequence of the smallness of 
Os (Q?). When all corrections of order a, (Q?) are neglected one recovers the naive 
parton model prediction for the final state: almost collinear events with two back- 
to-back jets with limited transverse momentum and an angular distribution as (1 + 
cos? 0) with respect to the beam axis (typical of spin 1/2 parton quarks: scalar quarks 
would lead to a sin? 0 distribution). At order o, (Q?) a tail of events is predicted 
to appear with large transverse momentum pr ^ Q/2 with respect to the thrust 
axis (the axis that maximizes the sum of the absolute values of the longitudinal 
momenta of the final state particles). This small fraction of events with large pr 
mostly consists of three-jet events with an almost planar topology. The skeleton of a 
three-jet event, at leading order in o; ( Q?), is formed by three hard partons q@g, the 
third being a gluon emitted by a quark or antiquark line. The distribution of three-jet 
events is given by: 


21.2 
LC EE NES S NN (4.64) 
o dx1d xa 3n (1 — x1)(1 — x2) 


here x1? refer to energy fractions of massless quarks: x; = 2E;//s with x1 + x2 + 
x3 = 2. At order o2 (Q?) a hard perturbative non planar component starts to build 
up and a small fraction of four-jet events gq gg or qqqq appear, and so on. 

A quantitatively specified definition of jet counting must be introduced for 
precise QCD tests and for measuring œs, which must be infrared safe (i.e. not altered 
by soft particle emission or collinear splittings of massless particles) in order to be 
computable at parton level and as much as possible insensitive to the transformation 
of partons into hadrons. One introduces a resolution parameter y,,; and a suitable 
pair variable; for example [17]: 


min(E?, E?) — cos ĝ;j) 
yj = ——————————— (4.65) 


S 
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The particles i,j belong to different jets for yj; > yc. Clearly the number of jets 
becomes a function of y;,;: there are more jets for smaller y,,;. Measurements of 
as (Q?) have been performed starting from jet multiplicities, the largest error coming 
from the necessity of correcting for non-perturbative hadronisation effects. 


4.5.3 Deep Inelastic Scattering 


Deep Inelastic Scattering (DIS) processes have played and still play a very important 
role for our understanding of QCD and of nucleon structure. This set of processes 
actually provides us with a rich laboratory for theory and experiment. There are 
several structure functions that can be studied, F; (x, Q?), each a function of two 
variables. This is true separately for different beams and targets and different 
polarizations. Depending on the charges of | and I’ (see Eq. (4.13)) we can have 
neutral currents (y,Z) or charged currents in the 1- channel (Fig. 4.6). In the past 
DIS processes were crucial for establishing QCD as the theory of strong interactions 
and quarks and gluons as the QCD partons. At present DIS remains very important 
for quantitative studies and tests of QCD. The theory of scaling violations for totally 
inclusive DIS structure functions, based on operator expansion or diagrammatic 
techniques and renormalisation group methods, is crystal clear and the predicted 
Q* dependence can be tested at each value of x. The measurement of quark and 
gluon densities in the nucleon, as functions of x at some reference value of Q?, 
which is an essential starting point for the calculation of all relevant hadronic hard 
processes, is performed in DIS processes. At the same time one measures o, (Q?) 
and the DIS values of the running coupling can be compared with those obtained 
from other processes. At all times new theoretical challenges arise from the study of 
DIS processes. Recent examples are the so-called "spin crisis" in polarized DIS and 
the behaviour of singlet structure functions at small x as revealed by HERA data. In 
the following we will review the past successes and the present open problems in 
the physics of DIS. 

The cross-section o ~ L""W,, is given in terms of the product of a leptonic 
(L") and a hadronic (W,,,) tensor. While L^" is simple and easily obtained 
from the lowest order electroweak (EW) vertex plus QED radiative corrections, 
the complicated strong interaction dynamics is contained in W,,,. The latter is 
proportional to the Fourier transform of the forward matrix element between the 
nucleon target states of the product of two EW currents: 


Wu = fa expiqx < PIT E(x) Jy (O)|p > (4.66) 
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Structure functions are defined starting from the general form of W,, given 
Lorentz invariance and current conservation. For example, for EW currents between 
unpolarized nucleons we have: 


v W : 2 
du ) Wiw, Q?) + (p, — 400» = PD mee ~ 


Ww = (—8uv + 2 
q 


m 


l 
= o3 6o P^ q^ Wa(v, Q?) 


W3 arises from VA interference and is absent for pure vector currents. In the limit 
Q? >> m’, x fixed, the structure functions obey approximate Bjorken scaling 
which in reality is broken by logarithmic corrections that can be computed in QCD: 
mW (v, Q^) > Fi(x) 
vWaa(v, O°) > Fa,3(x) (4.67) 


The y — N cross-section is given by (W; = W;(Q?, v)): 


do’ 4ra? E' Ds 20 a 29 wy (4.68) 
———— = ———— .[2sin^- = : 
d Qidv Q^E gi dove unge 


while for the v — N or v — N cross-section one has: 


d UA GA E' 2 0 0 E+E 0 
a = ea > - [2 sin? ZW; + cos? -W3 + Be sin? —W3] 
d Q*dv 2nE Q?-c mi, 2 2 m 2 


(4.69) 


(W; for photons, v and v are all different, as we shall see in a moment). 
In the scaling limit the longitudinal and transverse cross sections are given by: 


1 
ORH,LH ^* T) + F3(x)] 


OT = ORH + OLH (4.70) 


where L, RH, LH refer to the helicity 0, 1, — 1, respectively, of the exchanged gauge 
vector boson. 

In the *60's the demise of hadrons from the status of fundamental particles to that 
of bound states of constituent quarks was the breakthrough that made possible the 
construction of a renormalisable field theory for strong interactions. The presence 
of an unlimited number of hadrons species, many of them with large spin values, 
presented an obvious dead-end for a manageable field theory. The evidence for 
constituent quarks emerged clearly from the systematics of hadron spectroscopy. 
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The complications of the hadron spectrum could be explained in terms of the 
quantum numbers of spin 1/2, fractionally charged, u, d and s quarks. The notion 
of colour was introduced to reconcile the observed spectrum with Fermi statistics. 
But confinement that forbids the observation of free quarks was a clear obstacle 
towards the acceptance of quarks as real constituents and not just as fictitious 
entities describing some mathematical pattern (a doubt expressed even by Gell- 
Mann at the time). The early measurements at SLAC of DIS dissipated all doubts: 
the observation of Bjorken scaling and the success of the “naive” (not so much after 
all) parton model of Feynman imposed quarks as the basic fields for describing the 
nucleon structure (parton quarks). 

In the language of Bjorken and Feynman the virtual y (or, in general, any 
gauge boson) sees the quark partons inside the nucleon target as quasi-free, because 
their (Lorentz dilated) QCD interaction time is much longer than ty ~ 1/Q, the 
duration of the virtual photon interaction. Since the virtual photon 4-momentum 
is spacelike, we can go to a Lorentz frame where Ey, = O (Breit frame). In this 
frame q = (Ey = 0;0,0, Q) and the nucleon momentum, neglecting the mass 
m << Q,is p = (Q/2x; 0,0, — Q/2x) (note that this correctly gives q? = —Q? 
and x = Q?/2(p-q)). Consider (Fig. 4.10) the interaction of the photon with a quark 
carrying a fraction y of the nucleon 4-momentum: p; — yp (we are neglecting 
the transverse components of p; which are of order m). The incoming parton with 
Pq = yp absorbs the photon and the final parton has 4-momentum By Since in the 
Breit frame the photon carries no energy but only a longitudinal momentum Q, the 
photon can only be absorbed by those partons with y — x: then the longitudinal 
component of p; = yp is —yQ/2x = —Q/2 and can be flipped into +Q/2 by 
the photon. As a result, the photon longitudinal momentum +Q disappears, the 
parton quark momentum changes of sign from — Q/2 into + Q/2 and the energy is 
not changed. So the structure functions are proportional to the density of partons 
with fraction x of the nucleon momentum, weighted with the squared charge. Also, 
recall that the helicity of a massless quark is conserved in a vector (or axial vector) 
interaction. So when the momentum is reversed also the spin must flip. Since the 
process is collinear there is no orbital contribution and only a photon with helicity 
+1 (transverse photon) can be absorbed. Alternatively, if partons were spin zero 
only longitudinal photons would instead contribute. 

Using these results, which are maintained in QCD at leading order, the quantum 
numbers of the quarks were confirmed by early experiments. The observation that 
R = o,/or — 0 implies that the charged partons have spin 1/2. The quark charges 


Fig. 4.10 Schematic diagram —Q/2 

for the interaction of the 

virtual photon with a parton => Q 
quark in the Breit frame 


= +02 spin 
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were derived from the data on the electron and neutrino structure functions: 


Fep = Af9u(x) + 1/9d(x) + ....; Fen = 4/9d(x) + 1/9u(x) +... 
Fop = Fin = 2d(x) + .....; Fin = Fop = 2u(x) + ... (4.71) 


where F ^ 2F| ^ Fo/x and u(x), d(x) are the parton number densities in the 
proton (with fraction x of the proton longitudinal momentum), which, in the scaling 
limit, do not depend on Q?. The normalisation of the structure functions and the 
parton densities are such that the charge relations hold: 


1 1 1 
j! [u(x) — u(x)]dx = 2, f [d(x) — d(x)]dx zd, f [s(x) — s(x)]dx = 0 
0 0 0 
(4.72) 


Also it was proven by experiment that at values of Q? of a few GeV’, in the scaling 
region, about half of the nucleon momentum, given by the momentum sum rule: 


1 
f [3 aia E) + gG)lxdx = 1 (4.73) 


is carried by neutral partons (gluons). 
In QCD there are calculable log scaling violations induced by o; (t). The parton 
rules just introduced can be summarised in the formula: 


1 


D (4.74) 


1 
F(x,t) = f dy 9 a, i = (0) + o( 


Before QCD corrections Opoint = e75(x/y — l) and F = e?qo (x) (here we denote 
by e the charge of the quark in units of the positron charge, i.e. e — 2/3 for the 
u quark). QCD modifies Opoint at order a; via the diagrams of Fig. 4.11. Note that 
the integral is from x to 1, because the energy can only be lost by radiation before 
interacting with the photon (which eventually wants to find a fraction x, as we have 


yt ME I dif Xx g 2 
+ : MEETS + *o 
g 
g 
(a) (b) (c) (d) 


Fig. 4.11 First order QCD corrections to the virtual photon-quark cross-section: (a) leading order 
with (b) one-loop virtual correction; (c-d) next-to-leading order real emission 
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explained). From a direct computation of the diagrams one obtains a result of the 
following form: 


Opoint(Z, os (f)) = e^[B(z — 1) + Ze PR) + fI (4.75) 


For y > x the correction arises from diagrams with real gluon emission. Only the 
sum of the two real diagrams in Fig. 4.11 is gauge invariant, so that the contribution 
of one given diagram is gauge dependent. There is a special form of axial gauge, 
called physical gauge, where, among real diagrams, the diagram of Fig. 4.1 1c gives 
the whole t-proportional term. It is obviously not essential to go to this gauge, but 
this diagram has a direct physical interpretation: a quark in the proton has a fraction 
y > x ofthe parent 4-momentum; it then radiates a gluon and looses energy down to 
a fraction x before interacting with the photon. The log arises from the virtual quark 
propagator, according to the discussion of collinear mass singularities in Eq. (4.12). 
In fact in the massless limit one has: 


Oo 1 MES 1 
r2  (k=h)}  2EŁEņp |-—cos0 
EI 1 =| 

4E,En sin?0/2 — pj 


propagator — 
(4.76) 


where pr is the transverse momentum of the virtual quark. So the square of the 
propagator goes like 1/ ph. But there is a p factor in the numerator, because in the 
collinear limit, when 0 — 0 and the initial and final quarks and the emitted gluon are 
all aligned, the quark helicity cannot flip (vector interaction) so that the gluon should 
carry helicity zero but a real gluon can only have +1 helicity. Thus the numerator 
vanishes as p? in the forward direction and the cross-section behaves as: 


E 2 2 
o~ J — dp} ~ logQ (4.71) 
PT 


Actually the log should be read as log Q?/m? because in the massless limit a 
genuine mass singularity appears. In fact the mass singularity connected with the 
initial quark line is not cancelled because we do not have the sum of all degenerate 
initial states, but only a single quark. But in correspondence to the initial quark we 
have the (bare) quark density go(y) that appear in the convolution integral. This is 
a non perturbative quantity that is determined by the nucleon wave function. So we 
can factorize the mass singularity in a redefinition of the quark density: we replace 
qo) > q(y,t) = go(y) + AqCy. t) with: 


1 
Aq, t) = sir | 4,293. pe) (4.78) 
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Here the factor of t is a bit symbolic: it stands for log Q?/ km? and what we exactly 
put below Q? depends on the definition of the renormalised quark density, which 
also fixes the exact form of the finite term f(z) in Eq. (4.75). 

The effective parton density q (y, t) that we have defined is now scale dependent. 
In terms of this scale dependent density we have the following relations, where we 
have also replaced the fixed coupling with the running coupling according to the 
prescription derived from the RGE: 


[04 


q(y,t) D f» = equ. t) + olst) 
T y 


1 
Fe.n- [ dy eo =) + 


1 
2 pce f p en Ca aet (4.79) 
dt 2x Jy y y 


We see that in lowest order we reproduce the naive parton model formulae for the 
structure functions in terms of effective parton densities that are scale dependent. 
The evolution equations for the parton densities are written down in terms of kernels 
(the “splitting functions”) that can be expanded in powers of the running coupling. 
At leading order, we can interpret the evolution equation by saying that the variation 
of the quark density at x is given by the convolution of the quark density at y times 
the probability of emitting a gluon with fraction x/y of the quark momentum. 

It is interesting that the integro-differential QCD evolution equation for densities 
can be transformed into an infinite set of ordinary differential equations for Mellin 
moments [2]. The moment f, of a density f(x) is defined as: 


1 
fa = Í dxx" f) (4.80) 
0 


By taking moments of both sides of the second of Eqs. (4.79) one finds, with 
a simple interchange of the integration order, the simpler equation for the n-th 
moment: 


< ant) = SO. p.i au (4.81) 
PEL o n' n i 
To solve this equation we observe that: 
t p, f P, (9550 dol 
pe Sf nodi = f 2 (4.82) 
dn (0) 2x Jo T Jo, —ba’ 


where we used Eq. (4.31) to change the integration variable from dt to da(t) 
(denoted as da’) and B(a) ~ —bo? +... Finally the solution is: 


as 


EET 95 - qn(O) (4.83) 


g(t) = I 
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The connection of these results with the RGE general formalism occurs via 
the light cone OPE (recall Eq. (4.66) for W,» and Eq. (4.63) for the OPE of two 
currents). In the case of DIS the c-number term Z (x?) does not contribute, because 
we are interested in the connected part < p|...|p > — < O|...|0 >. The relevant 
terms are: 


oo 
JOJO = EGO) b» ca (xP) xt! tn. O gk (0) + less sing. terms 
n=0 


(4.84) 


A formally intricate but conceptually simple argument (Ref. [6], page 28) based on 
the analiticity properties of the forward virtual Compton amplitude shows that the 
Mellin moments M, of structure functions are related to the individual terms in 
the OPE, precisely to the Fourier transform c. (Q?) (we will write it as c; (t, œ)) of 
the coefficient cn (x2) times a reduced matrix element An from the operators O”: 


« pO. is (0)|p >= hnpy, ..- Puy! 
1 
Cn < plO"|p »— Mn al dxx" ^ F(x) (4.85) 
0 


Since the matrix element of the products of currents satisfy the RGE so do the 
moments M,. Hence the general form of the Q? dependence is given by the RGE 
solution (see Eq. (4.33)): 


a(t) 


da! - hn(a) (4.86) 


M,(t,a) = a0 ato]exp f Ms 


In lowest order, identifying in the simplest case M, with qn, we have: 


ma) = ste d iss Bla) = -bæ +... (4.87) 
and 
Oya). 4 Og Ph 
qn (t) = qu(0) exp | Ba) dæ = To” : qn (0) (4.88) 


which exactly coincides with Eq. (4.83). 

Up to this point we have implicitly restricted our attention to non-singlet (under 
the flavour group) structure functions. The Q? evolution equations become non 
diagonal as soon as we take into account the presence of gluons in the target. In 
fact the quark which is seen by the photon can be generated by a gluon in the target 
(Fig. 4.12). 
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Fig. 4.12 Lowest order * 
diagram for the interaction of 

the virtual photon with a 

parton gluon q 


kej 


The quark evolution equation becomes: 


NO, NO, 
T [gi D Pag] + lg & Pye (4.89) 


: (x, t) 
—q; (x ; = 
d 

where we introduced the shorthand notation: 


X 


1 
[4 8 P] = [P 8q] = / 4,29: . pE) (4.90) 


(it is easy to check that the convolution, like an ordinary product, is commutative). 
At leading order, the interpretation of Eq. (4.89) is simply that the variation of the 
quark density is due to the convolution of the quark density at a higher energy times 
the probability of finding a quark in a quark (with the right energy fraction) plus 
the gluon density at a higher energy times the probability of finding a quark (of the 
given flavour 1) in a gluon. The evolution equation for the gluon density, needed to 
close the system, can be obtained by suitably extending the same line of reasoning 
to a gedanken probe sensitive to colour charges, for example a virtual gluon. The 
resulting equation is of the form: 


¥(t) - s) 
: LG + Fi) 9 Pa + Tg & Pee] (4.91) 


d t 

— Aj = k 

qp rg x 
The explicit form of the splitting functions in lowest order [18, 19] can be directly 

derived from the QCD vertices [19]. They are a property of the theory and do not 

depend on the particular process the parton density is taking part in. The results are: 


4 dex" 3 
Pgq = 3l -o7 + gon — x)] + o(as) 


414 (1—3 


fgg = 


+ o(as) 
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1 
Pag = gl + (1— xY?] + olas) 


x 1—x 33 — 2n; 
Peg Er IM T = Tox(1—x) + E 49-9 + o(as) (4.92) 


For a generic non singular weight function f (x), the “+” distribution is defined as: 


1 
| x fo -f fe- fO, (4.93) 
o (=x) = lax 


The 6(1 — x) terms arise from the virtual corrections to the lowest order tree 
diagrams. Their coefficient can be simply obtained by imposing the validity of 
charge and momentum sum rules. In fact, from the request that the charge sum 
rules in Eq. (4.72) are not affected by the Q? dependence one derives that 


1 
| Płq(x)dx =y (4.94) 
0 


which can be used to fix the coefficient of the 6(1 — x) terms of P4,. Similarly, 
by taking the t-derivative of the momentum sum rule in Eq. (4.73) and imposing its 
vanishing for generic g; and g, one obtains: 


1 1 
Í [Paa (x) + Peg(x)|xdx = 0, ri [2n f Pag(x) + Pgg(x)]xdx = 0. 
(4.95) 


At higher orders the evolution equations are easily generalised but the cal- 
culation of the splitting functions rapidly becomes very complicated. For many 
years the splitting functions were only completely known at NLO accuracy [20]: 
asP ~ asP) + a2 P» +.... Then in recent years the NNLO results P3 have been 
first derived in analytic form for the first few moments and, then the full NNLO 
analytic calculation, a really monumental work, was completed in 2004 by Moch, 
Vermaseren and Vogt [21]. Beyond leading order a precise definition of parton 
densities should be specified. One can take a physical definition (for example, quark 
densities can be defined as to keep the LO expression for the structure function Fz 
valid at all orders, the so called DIS definition [22], and the gluon density could 
be defined starting from Fz, the longitudinal structure function, or a more abstract 
specification (for example, in terms of the M S prescription). Once the definition of 
parton densities is fixed, the coefficients that relate the different structure functions 
to the parton densities at each fixed order can be computed. Similarly the higher 
order splitting functions also depend, to some extent, from the definition of parton 
densities, and a consistent set of coefficients and splitting functions must be used at 
each order. 

The scaling violations are clearly observed by experiment and their pattern is 
very well reproduced by QCD fits at NLO. Examples are seen in Fig. 4.13a-d [23]. 
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Fig. 4.13 A recent NLO fit of scaling violations from Ref. [23], for different x ranges, as functions 


of Q? 


4 QCD: The Theory of Strong Interactions 117 


These fits provide an impressive confirmation of a quantitative QCD prediction, a 
measurement of qi (x, QD and g(x, Q3) at some reference value QB of Q? anda 
precise measurement of o; (m2). 


4.5.3.1 Resummation for Deep Inelastic Structure Functions 


At small or at large values of x (with Q? large) those terms of higher order in a; 
in either the coefficients or the splitting functions which are multiplied by powers 
of log 1/x or log (1 — x) eventually become important and should be taken into 
account. Fortunately the sequences of leading and subleading logs can be evaluated 
at all orders by special techniques and resummed to all orders. 

For large x resummation [24] I refer to the recent papers [25, 26] (the latter also 
involving higher twist corrections, which are important at large x) where a list of 
references to previous work can be found. 

Here we will briefly summarise the small-x case for the singlet structure function 
which is the dominant channel at HERA, dominated by the sharp rise of the gluon 
and sea parton densities at small x. The small x data collected by HERA can 
be fitted reasonably well even at the smallest measured values of x by the NLO 
QCD evolution equations, so that there is no dramatic evidence in the data for 
departures. This is surprising also in view of the fact that the NNLO effects in 
the evolution have recently become available and are quite large. Resummation 
effects have been shown to resolve this apparent paradox. For the singlet splitting 
function the coefficients of all LO and NLO corrections of order [as (QO?) log 1/x]" 
and o; (Q?)[o, (Q?) log 1/x]", respectively, are explicitly known from the BFKL 
analysis of virtual gluon-virtual gluon scattering [27, 28]. But the simple addition of 
these higher order terms to the perturbative result (with subtraction of all double 
counting) does not lead to a converging expansion (the NLO logs completely 
overrule the LO logs in the relevant domain of x and Q?). A sensible expansion 
is only obtained by a proper treatment of momentum conservation constraints, 
also using the underlying symmetry of the BFKL kernel under exchange of the 
two external gluons, and especially, of the running coupling effects (see the recent 
papers [29, 30] and refs. therein). In Fig. 4.14 we present the results for the dominant 
singlet splitting function x P(x, v, (Q?)) for o;(Q?) ~ 0.2. We see that while the 
NNLO perturbative splitting function sharply deviates from the NLO approximation 
at small x, the resummed result only shows a moderate dip with respect to the NLO 
perturbative splitting function in the region of HERA data, and the full effect of the 
true small x asymptotics is only felt at much smaller values of x. The related effects 
are not very important for processes at the LHC but could become relevant for next 
generation hadron colliders. 
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Fig. 4.14 The dominant 05 
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4.5.3.2 Polarized Deep Inelastic Scattering 


In polarized DIS one main question is how the proton helicity is distributed among 
quarks, gluons and orbital angular momentum: 1/2AX + Ag+ L; = 1/2 (fora 
recent review, see, for example, [31]). For a parton density p (either a quark or a 
gluon) Ap indicates the first moment of the net helicity difference p} — p. ina 
polarized proton with helicity +1/2 or: 


1 
Ap(Q?) = [ dx[pa.G, Q?) — p-(x, Q3] (4.96) 


Experiments have shown that the quark moment AP is small (the “spin crisis"): 
values from a recent fit [32] are AXz;p ~ 0.21 + 0.14 and Age, ~ 0.50 1.27 
at Q? = | GeV? (see also [33]). This is surprising because AX is conserved in 
perturbation theory at LO (i.e. it does not evolve in Q?). For conserved quantities 
we would expect that they are the same for constituent and for parton quarks. But 
actually the conservation of AX is broken by the axial anomaly. In perturbation 
theory the conserved density is actually AX’ = AX + ny/2zo; Ag [34]. Note 
that also œs Ag is conserved in LO, that is Ag ~ log Q7. This behaviour is not 
controversial but it will take long before the log growth of Ag will be confirmed by 
experiment! But to establish this behaviour would show that the extraction of Ag 
from the data is correct and that the QCD evolution works as expected. If Ag is large 
enough it could account for the difference between partons (AX) and constituents ( 
A’). From the spin sum rule it is clear that the log increase should cancel between 
Ag and L.. This cancelation is automatic as a consequence of helicity conservation 
in the basic QCD vertices. From the spin sum rule one obtains that either Ag + Lz is 
large or there are contributions to AX at very small x outside of the measured region. 
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Ag can be measured indirectly by scaling violations and directly from asymmetries, 
e.g. in cc production. Existing measurements by Hermes, Compass, and at RHIC are 
still crude but show no hint of a large Ag at accessible value of x and Q?. Present 
data are consistent with Ag large enough to sizeably contribute to the spin sum rule 
but there is no indication that o; Ag can explain the difference between constituents 
and parton quarks. The perspectives of better measurements are good at Compass 
and RHIC in the near future. 


4.5.4 Factorisation and the QCD Improved Parton Model 


The parton densities defined and measured in DIS are instrumental to compute 
hard processes initiated by hadronic collisions via the Factorisation Theorem (FT). 
Suppose you have a hadronic process of the form A, + h2 — X + all where h; are 
hadrons and X is some triggering particle or pair of particles which specify the large 
scale Q? relevant for the process, in general somewhat, but not much, smaller than 
s, the total c.o.m. squared mass. For example, in pp or pp collisions, X can bea W 
or a Z or a virtual photon with large Q?, or a jet at large transverse momentum pr, 
or a pair of heavy quark-antiquark of mass M. By "all" we mean a totally inclusive 
collection of gluons and light quark pairs. The FT states that for the total cross- 
section or some other sufficiently inclusive distribution we can write, apart from 
power suppressed corrections, the expression: 


pi e x | dxidx pia Ga, Q?) pre (x2, O)oap Gixos, T) (4.97) 
AB 
Here t = Q?/s is a scaling variable, pic are the densities for a parton of type 


C inside the hadron hi, oap is the partonic cross-section for parton-A + parton- 
B— X + all'. This result is based on the fact that the mass singularities that are 
associated with the initial legs are of universal nature, so that one can reproduce 
the same modified parton densities, by absorbing these singularities into the bare 
parton densities, as in deep inelastic scattering. Once the parton densities and a; are 
known from other measurements, the prediction of the rate for a given hard process 
is obtained with not much ambiguity (e.g from scale dependence or hadronisation 
effects). The NLO calculation of the reduced partonic cross-section is needed in 
order to correctly specify the scale and in general the definition of the parton 
densities and of the running coupling in the leading term. The residual scale and 
scheme dependence is often the most important source of theoretical error. In the 
following we consider a few examples. 

A comparison of data and predictions on the production of jets at large ./s and 
Pr in pp or pp collisions is shown in Fig. 4.15 [9, 35]. 

This is a particularly significant test because the rates at different c.o.m. energies 
and, for each energy, at different values of pr span over many orders of magnitude. 
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Fig. 4.15 Jet production o* 
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This steep behaviour is determined by the sharp falling of the parton densities 
at large x. Also the corresponding values of ./s and pr are large enough to be 
well inside the perturbative region. The overall agreement of the data from ISR, 
UA1,2 and CDF and DO is spectacular. Similar results also hold for the production 
of photons at large pr. The collider data [36], shown in Fig.4.16 [9], are in 
fair agreement with the theoretical predictions. For the same process less clear a 
situation is found with fixed target data. Here, first of all, the experimental results 
show some internal discrepancies. Also, the pr accessible values being smaller, the 
theoretical uncertainties are larger. But it is true that the agreement is poor, so that 
the necessity of an "intrinsic" transverse momentum of partons inside the hadron 
of over 1 GeV has been claimed, which theoretically is not justified (rather, given 
the sharp falling down at large pr, it could be interpreted as a correction for pr 
calibration errors). 

For heavy quark production at colliders [42] the agreement is very good for the 
top crosssection at the Tevatron (Fig. 4.17) [43, 44]. The bottom production at the 
Tevatron has for some time represented a problem [45]. The total rate and the pr 
distribution of b quarks observed at CDF appeared in excess of the prediction, up 
to the largest measured values of pr. But this is a complicated case, with different 
scales being present at the same time: ~s, pr, mp. Finally the problem has been 
solved (Fig.4.18) by better taking into account a number of small effects from 
resummation of large logarithms, the difference between b hadrons and b partons, 
the inclusion of better fragmentation functions etc. [46]. 
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Fig. 4.16 Single photon 
production in pp colliders as 
function of pr [9] 
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Fig. 4.17 The t production 
cross-section at the Tevatron 
pp collider [44] 
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Drell-Yan processes, including lepton pair production via virtual y, W or Z 
exchange, offer a particularly good opportunity to test QCD. The process is 
quadratic in parton densities, and the final state is totally inclusive, while the large 
scale is specified and measured by the invariant mass squared Q? of the lepton pair 
which itself is not strongly interacting (so there no dangerous hadronisation effects). 
The QCD improved parton model leads directly to a prediction for the total rate as 
a function of Q?. The value of the LO cross-section is inversely proportional to the 
number of colours Nc because a quark of given colour can only annihilate with an 
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Fig. 4.18 The b production 10° 
pr distribution at the - 
Tevatron pp collider [47]. 5 
The data from CDF also 
include systematics and 
correlations. The theoretical 
curve with the uncertainty 
range is from Ref. [46] 
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antiquark of the same colour to produce a colourless lepton pair. The order a; ( Q?) 
corrections to the total rate were computed long ago and found to be particularly 
large [22, 38], when the quark densities are defined from the structure function 
F measured in DIS at q? = -Q?. The ratio oco, /oro of the corrected and the 
Born cross-sections, was called K-factor, because it is almost a constant in rapidity. 
In recent years also the NLO full calculation of the K-factor was completed, a 
very remarkable calculation [37]. The QCD predictions have been best tested for 
W and Z production at CERN SppS and Tevatron energies. Q ^ mw,z is large 
enough to make the prediction reliable (with a not too large K-factor) and the ratio 
JT = Q/ s is not too small. Recall that in lowest order xixos = Q? so that the 
parton densities are probed at x values around ./t. We have ,/t = 0.13 — 0.15 (for 
W and Z production, respectively) at v/s = 630 GeV (CERN SppS Collider) and 
A/t = 0.04 — 0.05 at the Tevatron. In this respect the prediction is more delicate at 
the LHC, where yT ~ 5.7 — 6.5 - 1073. One comparison of the experimental total 
rates at the Tevatron [48] with the QCD predictions is shown in Fig. 4.19, together 
with the expected rates at the LHC (based on the structure functions obtained in 
[23]. 

The calculation of the W/Z pr distribution has been a classic problem in 
QCD. For large pr, for example pr ^ o(my), the pr distribution can be 
reliably computed in perturbation theory, which was done up to NLO in the late 
“70’s and early '80's. A problem arises in the intermediate range Agcp << 
pr «« my, where the bulk of the data is concentrated, because terms of order 
As ( p?) log mt / p; become of order one and should included to all orders [39]. At 
order o, we have: 


1 doo B m? 
— —;, = (+4) + —log—- + —— + D(pj) (498) 
90 dp PT T + (pT)4- 
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Fig. 4.19 Data vs. theory for W and Z production at the Tevatron (4/5 = 1.8 TeV) together with 
the corresponding predictions for the LHC (./s= 1.4 TeV) [48] 


where A, B, C, D are coefficients of order œs. The “+” distribution is defined in 
complete analogy with Eq. (4.93): 


PTMAX PM X 
Í g(z)f().dz = Í i [g(z) — gO) f(z)dz (4.99) 


The content of this, at first sight mysterious, definition is that the singular “+” terms 
do not contribute to the total cross-section. In fact for the cross-section the weight 
function g(z) — 1 and we obtain: 


PTMAX 
o = olll +A) + f D(z)dz] (4.100) 
0 


The singular terms, of infrared origin, are present at the non completely inclusive 
level but disappear in the total cross-section. Arguments have been given that these 
singularities are expected to exponentiate. Explicit calculations in low order support 
the exponentiation which leads to the following expression: 


1 doo d?b 
——, = | -—exp(-ib: pr)( + A)exp S(b) (4.101) 
00 dp. 4r 
with: 
PTMAX d?kr B m2, C 
S(b) = f [expikr : b — 1][> log 5- + SI] (4.102) 
0 2x ke ki ke 


At large pr the LO perturbative expansion is recovered. At intermediate pr 
the infrared pr singularities are resummed (the Sudakov log terms, which are 
typical of vector gluons, are related to the fact that for a charged particle in 
acceleration it is impossible not to radiate, so that the amplitude for no soft 
gluon emission is exponentially suppressed). However this formula has problems 
at small pr, for example, because of the presence of as under the integral for S(b): 
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Fig. 4.20 QCD predictions for the W pr distribution compared with recent DO data at the 
Tevatron (4/s = 1.8 TeV)[49] [40] 


presumably the relevant scale is of order kh: So it must be completed by some non 
perturbative ansatz or an extrapolation into the soft region. All the formalism has 
been extended to NLO accuracy, where one starts from the perturbative expansion 
at order o2, and generalises the resummation to also include NLO terms of order 
As ( pry log nt, / p? (see, for example, [40]). The comparison with the data is very 
impressive. In Fig. 4.20 we see the pr distribution as predicted in QCD (with a 
number of variants that mainly differ in the approach to the soft region) compared 
with some recent data at the Tevatron [49]. 

A great effort is being devoted to the preparation to the LHC. Calculations 
for specific processes are being completed. A very important example is Higgs 
production via g + g — H. The amplitude is dominated by the top quark loop, as 
discussed in Chap. 3 [51]. The NLO corrections turn out to be particularly large [52], 
as seen in Fig. 4.21. Higher order corrections can be computed either in the effective 
lagrangian approach, where the heavy top is integrated away and the loop is shrunk 
down to a point [53] [the coefficient of the effective vertex is known to TM accuracy 
[54]]. or in the full theory. At the NLO the two approaches agree very well for the 
rate as a function of mg [55]. The NNLO corrections have been computed in the 
effective vertex approximation [56] (see Fig. 4.21). Beyond fixed order resummation 
of large logs were carried out [57]. Also the NLO EW contributions have been 
computed [58]. Rapidity (at NNLO) [59] and pr distributions (at NLO) [60] have 
also been evaluated. At smaller pr the large logarithms [/og(pr / m g)]" have been 
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resummed in analogy with what was done long ago for W and Z production [61]. 
For additional recent works on Higgs physics at colliders see, for example, [62]. 

The activity on event simulation also received a big boost from the LHC prepa- 
ration (see, for example, the review [50]). General algorithms for performing NLO 
calculations numerically (requiring techniques for the cancellation of singularities 
between real and virtual diagrams) have been developed (see, for example, [65]). 
The matching of matrix element calculation of rates together with the modeling of 
parton showers has been realised in packages, as for example in the MC@NLO [63] 
or POWHEG [64] based on HERWIG. The matrix element calculation, improved by 
resummation of large logs, provides the hard skeleton (with large pr branchings) 
while the parton shower is constructed by a sequence of factorized collinear 
emissions fixed by the QCD splitting functions. In addition, at low scales a model 
of hadronisation completes the simulation. The importance of all the components, 
matrix element, parton shower and hadronisation can be appreciated in simulations 
of hard events compared with the Tevatron data. 

At different places in the previous pages we have seen examples of resummation 
of large logs. This is a very important chapter of modern QCD. The resummation 
of soft gluon logs enter in different problems and the related theory is subtle. I refer 
the reader here to some recent papers where additional references can be found 
[66]. A particularly interesting related development has to do with the so called non 
global logs (see, for example, [67]). If in the measurement of an observable some 
experimental cuts are introduced, which is a very frequent case, then a number 
of large logs can arise from the corresponding breaking of inclusiveness. The 
discussion of this problem has led to rethinking the theory of final state observables. 
It is also important to mention the development of software for the automated 
implementation of resummation (see, for example, [68]). 
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Before closing this section I would like to mention some very interesting 
developments at the interface between string theory and QCD, twistor calculus. A 
precursor work was the Parke-Taylor result in 1986 [69] on the amplitudes for n 
incoming gluons with given helicities [70]. Inspired by dual models, they derived a 
compact formula for the maximum non vanishing helicity violating amplitude (with 
n—2 plus and 2 minus helicities) in terms of spinor products. Using the relation 
between strings and gauge theories in twistor space Witten developed in '03 [71] 
a formalism in terms of effective vertices and propagators that allows to compute 
all helicity amplitudes. The method, alternative to other modern techniques for the 
evaluation of Feynman diagrams [73], leads to very compact results. Since then 
rapid progress followed (for reviews, see [72]): for tree level processes powerful 
recurrence relations were established [74], the method was extended to include 
massless external fermions [75] and also external EW vector bosons [76] and Higgs 
particles [77]. The level already attained is already important for multijet events at 
the LHC. And the study of loop diagrams has been started. In summary, this road 
looks new and promising. 


4.6 Measurements of a; 


+ 


Very precise and reliable measurements of as (mz) are obtained from e* e colliders 


(in particular LEP) and from deep inelastic scattering. 


4.6.1 a; from e*e~ Colliders 


The main methods at ete~ colliders are: (a) Inclusive hadronic Z decay, Rj, On, Ol, 
Tz. (b) Inclusive hadronic t decay. (c) Event shapes and jet rates. 

As we have seen, for a quantity like R; we can write a general expression of the 
form: 


Dl(Z,t — hadrons) EW 
Rp = ———————— ~ R 1 ô ô e. 4.103 
4 l(Z,r — leptons) (Y 0ocb +ênp) + ( ) 


where REW is the electroweak-corrected Born approximation, óoc p, dnp are the 
perturbative (logarithmic) and non perturbative (power suppressed) QCD correc- 
tions. For a measurement of o, at the Z (in the following we always refer to the 
MS definition of 0/5) one can use all info from R;, rz = 30; + EF; and (f=h or 
D oy = 1201 T ¢/ m2 T2. In the past the measurement from R; was preferred 
(by itself it leads to o/; (mz) = 0.1226 + 0.0058 — 0.0036) but at LEP there is no 
reason for that. In all these quantities œs enters through T}, but the measurements 
of, say, Tz, R; and o are really independent (they are affected by entirely different 
systematics: Iz is extracted from the line shape, R; and o; are measured at the peak 
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but R; does not depend on the absolute luminosity while o; does).The most sensitive 
single quantity is o;. The combined value from the measurements at the Z (assuming 
the validity of the SM and a light Higgs mass) is [78]: 


os (mz) = 0.119 + 0.003 (4.104) 


For a relatively light Higgs (even if not as light as from the fit to EW observables) the 
final error is mainly experimental with a theoretical component from our ignorance 
of my, of higher orders in the QCD expansion [79] and also from uncertainties on 
the Bhabha luminometer (which affect o; ;) [80]. By adding all other electroweak 
precision electroweak tests (in particular mw) one similarly finds [41]: 


as (mz) = 0.1185 + 0.003 (4.105) 


We now consider the measurement of o, (mz) from t decay. R, has a number 
of advantages that, at least in part, tend to compensate for the smallness of m, — 
1.777 GeV. First, R; is maximally inclusive, more than R,+,-(s), because one also 
integrates over all values of the invariant hadronic squared mass: 


Lom ds S 2 
R, = - | — (l — —) ImIl (s) (4.106) 
T Jo mī mī 


The perturbative contribution is known at NNLO. Analyticity can be used to 
transform the integral into one on the circle at |s| = m?: 


1 ds S 5 
— (1 — —) I) (4.107) 
m: 


po ; 
2zi Ji, mz 


Also, the factor (1 — +)? is important to kill the sensitivity the region Re[s] = m2 
where the physical cut and the associated thresholds are located. Still the quoted 


result [81] looks a bit too precise: 
as (mz) = 0.345 + 0.010 (4.108) 
or 
as (mz) = 0.1215 + 0.0012 (4.109) 


This precision is obtained by taking for granted that corrections suppressed by 1/m? 
are negligible. This is because, in the massless theory, the light cone expansion is 
given by: 


ZERO < 04 > < 06> 
np = L Ru “3 H cg SEZ e... (4.110) 
m m m 


T T T 
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In fact there are no dim-2 Lorentz and gauge invariant operators. For example, g, g^ 
is not gauge invariant. In the massive theory, the ZERO is replaced by light quark 
mass-squared m°. This is still negligible if m is taken as a lagrangian mass of a 
few MeV. If on the other hand the mass were taken to be the constituent mass of 
order A gc p, this term would not be at all negligible and would substantially affect 
the result (note that o, (m;)/z ~ 0.1 ~ (0.6 GeV/m,)? and that A gc p for three 
flavours is large). For example, the PDG value and estimate of the error is [9]: 


os (mz) = 0.120 + 0.003. (4.111) 


Most people believe the optimistic version. I am not convinced that the gap is 
not filled up by ambiguities of 0(^5cp/mz) from ôpert [82]. In any case, one 
can discuss the error, but it is true and remarkable, that the central value from 
t decay, obtained at very small Q?, is in good agreement with all other precise 
determinations of a, at more typical LEP values of Q?. 

Important determinations of o,(mz) are obtained from different infrared safe 
observables related to event rates and jet shapes in ete~ annihilation. The main 
problem of these measurements is the large impact of non perturbative hadronization 
effects on the result and therefore on the theoretical error. The perturbative part is 
known at NLO. One advantage is that the same measurements can be repeated at 
different ./s values (e.g. with the same detectors at LEP1 or LEP2) allowing for a 
direct observation of the energy dependence. A typical result, from jets and event 
shapes at LEP, quoted in Ref. [83], is given by: 


as (mz) = 0.121 + 0.005. (4.112) 


Recently the rate of 4-jet events (proportional to a2) at LEP as function of yoy; has 
been used [84], for which a NLO theoretical calculation exists [85]. The quoted 
result is as(mz) = 0.1176 + 0.0022 (the actual error could be somewhat larger 
because the ambiguity from hadronisation modeling is always debatable). 


4.6.2 as from Deep Inelastic Scattering 


QCD predicts the Q? dependence of F(x, Q?) at each fixed x, not the x shape. But 
the Q? dependence is related to the x shape by the QCD evolution equations. For 
each x-bin the data allow to extract the slope of an approximately straight line in 
dlog F (x, Q?)/dlog Q?: the log slope. The Q? span and the precision of the data 
are not much sensitive to the curvature, for most x values. A single value of Agcp 
must be fitted to reproduce the collection of the log slopes. For the determination of 
as the scaling violations of non-singlet structure functions would be ideal, because 
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of the minimal impact of the choice of input parton densities. We can write the 
non-singlet evolution equations in the form: 


es (t) ‘dy FQ.) » G 
2n J, y F(x,t) Fag 


og F(x, t) = , ds (t)) (4.113) 


where P4, is the splitting function. At present NLO and NNLO corrections are 
known. It is clear from this form that, for example, the normalisation error on the 
input density drops away, and the dependence on the input is reduced to a minimum 
(indeed, only a single density appears here, while in general there are quark and 
gluon densities). Unfortunately the data on non-singlet structure functions are not 
very accurate. If we take the difference of data on protons and neutrons, Fp — Fy, 
experimental errors add up in the difference and finally are large. The F3,,, data are 
directly non-singlet but are not very precise. A determination of a, from the CCFR 
data on F3, y has led to [86]: 


o (mz) = 0.119 + 0.006 (4.114) 


A recent analysis of the same data leads to o, (mz) = 0.119 + 0.002 [87], but 
the theoretical error associated with the method and with the choice adopted for the 
scale ambiguities is not considered. A fit to non singlet structure functions in electro- 
or muon-production extracted from proton and deuterium data at the NNLO level 
was performed in Ref. [88] with the result: 


as (mz) = 0.114 + 0.002 (4.115) 


When one measures gs from scaling violations on F> from e or u beams, the data 
are abundant, the errors small but there is an increased dependence on input parton 
densities and especially a strong correlation between the result on o; and the input 
on the gluon density. There are complete and accurate derivations of a, from scaling 
violations in F5. In a well known analysis by Santiago and Yndurain [89], the data 
on protons from SLAC, BCDMS, E665 and HERA are used with NLO kernels plus 
the NNLO first few moments. The analysis is based on an original method that uses 
projections on a specially selected orthogonal basis, the Bernstein polynomials. The 
quoted result is given by: 


as (mz) = 0.1163 + 0.0014 (4.116) 
(these authors also quote o; (mz) = 0.115 + 0.006 for F3 data in v N scattering). A 


different analysis by Alekhin [90] of existing data off proton and deuterium targets 
with NNLO kernels and a more conventional method leads to 


os (mz) = 0.114 + 0.002 (4.117) 
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In both analyses the dominant error is theoretical and, in my opinion, should be 
somewhat larger than quoted. An interesting perspective on theoretical errors can 
be obtained by comparing analyses with different methods. We add the following 
examples. From truncated moments (but with a limited set of proton data and NLO 
kernels) [91]: o; (mz) = 0.122+0.006, from Nachtmann moments (which take into 
account some higher twist terms) and proton data [92]: v, (mz) = 0.1188 3: 0.0017. 
A combination of measurements at HERA by H1 and Zeus, also including final state 
jet observables, leads to o, (mz) = 0.1186 + 0.0051 [93], most of the error being 
theoretical. Finally, to quote a number that appears to me as a good summary of 
the situation of o/; (mz) from DIS one can take the result from a NNLO analysis of 
available data by the MRST group [94] as quoted by Particle Data Group, W.-M. 
Yao et al. [9]: 


os (mz) = 0.1167 + 0.0036 (4.118) 


If we compare these results on o; from DIS with the findings at the Z, given by 
Eq. (4.105), we see that the agreement is good, with the value of o; from the most 
precise DIS measurements a bit on the low side with respect to e* e^. 


4.6.3 Summary on ds 


There are a number of other determinations of o; which are important because they 
arise from qualitatively different observables and methods. For example [9, 83], 
some are obtained from the Bjorken sum rule and the scaling violations in polarized 
DIS, from Y decays, from the 4-jet rate in e* e^. A special mention deserves the 
“measurement” of o; from lattice QCD [95]. A number of hadronic observables, 
in particular Y’ — Y splitting, pion and kaon decay constants, the B; mass and the 
Q baryon mass are used to fix the lattice spacing and to accurately tune the QCD 
simulation. The value of o; is then obtained by computing non perturbatively a 
number of quantities related to Wilson loops that can also be given in perturbation 
theory. The result is then evolved with state of the art beta functions to obtain 
ads(mz) = 0.1170 + 0.0012. This result is interesting for its really special nature 
but it is not clear that the systematics due to the lattice technology is as small as 
claimed. 

Summarising: there is very good agreement among many different measurements 
of as. In Fig. 4.22 [83], a compilation of the data is reported with each measurement 
plotted at the scale of the experiment, which shows the consistency of the measure- 
ments and the running of œs. This is a very convincing, quantitative test of QCD. 
If I take the values of o/; (Hz) from precision electroweak data, Eq. (4.105), from c 
decay with the central value as in Eq. (4.109) but the larger error as in Eq. (4.111), 
from jets in et e^, Eq. (4.112), and from DIS, Eq. (4.118), the average is : 


as (mz) = 0.119 + 0.002 (4.119) 
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For comparison, the average value quoted by PDG 2006 is æs(mz) = 0.1176 + 
0.0020 while Ref. [83] gives o; (mz) = 0.1189 + 0.0010. 
The value of A (for n y = 5) which corresponds to Eq. (4.119) is: 


As = 221425 MeV (4.120) 


A is the scale of mass that finally appears in massless QCD. It is the scale where 
as(A) is of order one. Hadron masses are determined by A. Actually the o mass 
or the nucleon mass receive little contribution from the quark masses (the case of 
pseudoscalar mesons is special, as they are the pseudo Goldstone bosons of broken 
chiral invariance). Hadron masses would be almost the same in massless QCD. 


4.7 Conclusion 


We have seen that perturbative QCD based on asymptotic freedom offers a rich 
variety of tests and we have described some examples in detail. QCD tests are not 
as precise as for the electroweak sector. But the number and diversity of such tests 
has established a very firm experimental foundation for QCD as a theory of strong 
interactions. The field of QCD appears as one of great maturity but also of robust 
vitality with many rich branches and plenty of new blossoms. The physics content of 
QCD is very large and our knowledge, especially in the non perturbative domain, is 
still very limited but progress both from experiment (Tevatron, RHIC, LHC...... ) 
and from theory is continuing at a healthy rate. And all the QCD predictions that 
we were able to formulate and to test appear to be in very good agreement with 
experiment. 
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Chapter 5 ff) 
QCD on the Lattice ies 


Hartmut Wittig 


5.1 Introduction and Outline 


Since Wilson's seminal papers of the mid-1970s, the lattice approach to Quantum 
Chromodynamics has become increasingly important for the study of the strong 
interaction at low energies, and has now turned into a mature and established 
technique. In spite of the fact that the lattice formulation of Quantum Field Theory 
has been applied to virtually all fundamental interactions, it is appropriate to discuss 
this topic in a chapter devoted to QCD, since by far the largest part of activity is 
focused on the strong interaction. Lattice QCD is, in fact, the only known method 
which allows ab initio investigations of hadronic properties, starting from the QCD 
Lagrangian formulated in terms of quarks and gluons. 


5.1.1 Historical Perspective 


In order to illustrate the wide range of applications of the lattice formulation, we 
give a brief historical account below. 

First applications of the lattice approach in the late 1970s employed analytic 
techniques, predominantly the strong coupling expansion, in order to investigate 
colour confinement and also the spectrum of glueballs. While these attempts gave 
valuable insights, it soon became clear that in the case of non-Abelian gauge theories 
such expansions were not sufficient to produce quantitative results. 
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First numerical investigations via Monte Carlo simulations, focusing in particular 
on the confinement mechanism in pure Yang-Mills theory, were carried out around 
1980. The following years saw already several valiant attempts to study QCD 
numerically, yet it was realized that the available computer power was grossly 
inadequate to incorporate the effects of dynamical quarks. It was then that the so- 
called “quenched approximation" of QCD was proposed as a first step to solving 
full QCD numerically. This approximation rests on the ad hoc assumption that 
the dominant non-perturbative effects are mediated by the gluon field. Hadronic 
observables can then be computed on a pure gauge background with far less 
numerical effort compared to the real situation where quarks have a feedback on the 
gluon field. The main focus of activity during the 1980s was on bosonic theories: 
numerical simulations were used to compute the glueball spectrum in pure Yang— 
Mills theory. Another important result during this period concerned $^-theory and 
the implications of its supposed “triviality” for the Higgs-Yukawa sector of the 
Standard Model. Using a combination of analytic and numerical techniques, the 
triviality of 9^ theory could be rigorously established. 

Except for a brief spell of activity around the turn of the decade to simulate 
QCD with dynamical fermions, most projects in the 1990s were devoted to explore 
quenched QCD. Having recognized that the available computers and the efficiency 
of known algorithms were by far not sufficient to perform "realistic" simulations 
of QCD with controlled errors, lattice physicists resorted to exploring the quenched 
approximation and its limitations for a number of phenomenologically interesting 
quantities. Although the systematic error that arises by neglecting dynamical quarks 
could not be quantified reliably, many important quantities, such as quark and 
hadron masses, the strong coupling constant and weak hadronic matrix elements, 
were computed for the first time. One of the icons of that period was surely a 
plot of the masses of the lightest hadrons in the continuum limit of quenched 
QCD, produced by the CP-PACS Collaboration: their results indicated that the 
quenched approximation works surprisingly well (at least for these quantities), since 
the computed spectrum agreed with experimental determinations at the level of 
1096. Simultaneously, a number of sophisticated techniques have been developed 
during the 1990s, thereby helping to control systematic effects, mainly pertaining 
to the influence of lattice artefacts, as well as the renormalization of local operators 
in the lattice regularized theory and their relation to continuum schemes such as 
MS. Perhaps the most significant development at the end of the 1990s was the 
clarification of the issue of chiral symmetry and lattice regularization. Following 
this work it is now understood under which conditions the lattice formulation is 
compatible with chiral symmetry. The importance of this development extends far 
beyond QCD and implies new prospects for the non-perturbative study of chiral 
gauge theories. 

Since 2000 the focus has decidedly shifted from the quenched approximation 
to serious attempts to simulate QCD with dynamical quarks, thereby tackling the 
biggest remaining systematic uncertainty. Progress in this area has not just been 
determined by the vast increase in computer power since the very first Monte Carlo 
simulations, but rather by the development of new algorithmic ideas, combined 
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with the use of alternative discretizations that are numerically more efficient. At 
the time of writing this contribution (2007), the whole field is actually in a state 
of transition: although the quenched approximation is being abandoned, the latest 
results from simulations with dynamical quarks have not yet reached the same level 
of accuracy in regard to controlling systematic errors due to lattice artefacts and 
effects from renormalization, as compared to earlier quenched calculations. It can 
thus be expected that many of the results discussed later in this chapter will soon 
be superseded by more accurate numbers. In turn, the quenched approximation will 
be completely obsolete in a few years time, except perhaps to test new ideas or for 
exploratory studies of more complex quantities. 


5.1.2 Outline 


We begin with an introduction of the basic concepts of the lattice formulation of 
QCD. This shall include the field theoretical foundations, discretizations of the QCD 
Lagrangian, as well as simulation algorithms and other technical aspects related to 
the actual calculation of physical observables from suitable correlation functions. 
The following sections deal with various applications. Lattice calculations of 
the hadron spectrum are described in Sect.5.3. Section 5.4 is devoted to lattice 
investigations of the confinement phenomenon. Determinations of the fundamental 
parameters of QCD, namely the strong coupling constant and quark masses are 
a major focus of this article, and are presented in Sect.5.5. Another important 
property of QCD, namely the spontaneously broken chiral symmetry, is discussed in 
some detail in Sect. 5.6, which also includes a brief introduction into analytical non- 
perturbative approaches to the strong interaction, based on effective field theories. 
Lattice calculations of weak hadronic matrix elements, which serve to pin down the 
elements of the Cabibbo-Kobayashi-Maskawa matrix, are covered in Sect. 5.7. We 
end this contribution with a few concluding remarks. 

In addition to the topics listed above, lattice simulations of QCD have also 
made important contributions to the determination of the phase structure of QCD, 
including results for the critical temperature of the deconfinement phase transition. 
Nevertheless, in this chapter we restrict the discussion to QCD at zero temperature 
and refer the reader to other parts of this volume. 


5.0 The Lattice Approach to QCD 


The essential features of the lattice formulation can be summarized by the following 
statement: 


Lattice QCD is the non-perturbative approach to the gauge theory of the strong interaction 
through regularized, Euclidean functional integrals. The regularization is based on a 
discretization of the QCD action which preserves gauge invariance at all stages. 
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This definition includes all basic ingredients: starting from the functional integral 
itself avoids any particular reference to perturbation theory. This is what we mean 
when we call lattice QCD an ab initio method. The Euclidean formulation, which is 
obtained by rotating to imaginary time, reveals the close relation between Quantum 
Field Theory and Statistical Mechanics. In particular, the Euclidean functional 
integral is equivalentto the partition function of the corresponding statistical system. 
This equivalence is particularly transparent if the field theory is formulated on a 
discrete space-time lattice. Via this relation, the whole toolkit of condensed matter 
physics, including high-temperature expansions, and, perhaps most importantly, 
Monte Carlo simulations, are at the disposal of the field theorist. 

Many of the basic concepts introduced in this section are discussed in several 
common textbooks on the subject [1—4], which can be consulted for further details. 


5.2.1 Euclidean Quantization 


The generic steps in the Euclidean quantization procedure of a lattice field theory 
are the following: 


1. Define the classical, Euclidean field theory in the continuum; 

2. Discretize the corresponding Lagrangian; 

3. Quantize the theory by defining the functional integral; 

4. Determine the particle spectrum from Euclidean correlation functions. 


We shall now illustrate this procedure for a simple example, namely the theory for 
a neutral scalar field. 


Step 1 Consider a real, classical field 9 (x), with x = (x9, x! , x?, x?), whose time 
variable x is obtained by analytically continuing t to —ix?. The Euclidean action 
Selo] is defined as 


1 ð 
Sel] = J dfx E ve». d, m (5.1) 
Xu 


V$) = I + tga). (5.2) 


Step 2 In order to discretize the theory, a hyper-cubic lattice, Ag, is introduced as 
the set of discrete space-time points, i.e. 


Àp= [x e R*|x°/a=1,..., Nc sac od sg edis 


T = Na, L = Nsa. (5.3) 
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Thus, any space-time point is an integer multiple of the lattice spacing a. The total 
number of lattice sites is Nt x NÈ , While the physical space-time volume is T x L?. 
The discretized action is then given by 


1 1 Xr 
Sele] =a* Y^ [54i 6d o tmo) d (5.4) 


X€AEB 


where the lattice derivatives can be defined as 
1 
d G(x) := — (p(x + aj) — (x) “forward” derivative, (5.5) 
a 
1 
dj P(x) = (¢ (x) —o(x —a [y "backward" derivative. ^ (5.6) 
a 


Here and below ji denotes a unit vector in direction of u. Via a Fourier transform, 
the Euclidean lattice Ag is related to the dual lattice, AZ, defined by 


27 27 
4 0 
No |pen mo En, pj = Zn’ } 
Ni Ni Ni ; Ng Ng Ng 
0 t t t i S i 
=-—,-—+],...,—-l, J = ——, —-— + 1,...,— - 1. (57 
n 2 27 2 n 2 aT 2 (5.7) 


This not only implies that the momenta po and p; are quantized in units of 27 /T 
and 27r/L, respectively, but also that a momentum cutoff has been introduced, since 


(5.8) 


Bie < 
— € pu € 
a H 


FEE 


As we shall see below, this way of introducing a momentum cutoff can be extended 
to gauge theories in such a way that gauge invariance is respected. An important 
point to realize is that the lattice action is not unique: it is only required that the 
discretized expression for Sp reproduces the continuum result as the lattice spacing 
a is taken to zero. 


Step 3 The theory is quantized via the Euclidean functional integral 


ys f Digle “1, Dial = J] déco. (5.9) 


X€ AE 


Here one sees explicitly that the discretization procedure has given a mathematical 
meaning to the integration measure, which reduces to that of an ordinary, multiple- 
dimensional integration. 
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One can now define Euclidean correlation functions of local fields through 


1 
(60): :$69) — 77 f Diplo (x1) Q (Xn), (5.10) 


In the continuum limit, these correlation functions approach the Schwinger func- 
tions, which encode the physical information about the spectrum within the 
Euclidean formulation. Osterwalder and Schrader [5] have laid down the general 
criteria which must be satisfied such that the information in Minkowskian space- 
time can be reconstructed from the Schwinger functions. 


Step 4 The particle spectrum is extracted from the exponential fall-off of the 
Euclidean two-point correlation function. To this end, one must define the Euclidean 
time evolution operator. The transfer matrix T describes time propagation by a finite 
Euclidean time interval a. The functional integral can be expressed in terms of the 
transfer matrix as 


Ze = TrTM, (5.11) 


where the trace is taken over the basis |o) of the Hilbert space of physical states. 
In order to obtain expressions which are more reminiscent of those in Minkowski 
space-time, one can define a Hamiltonian Hg by 


T =: ek, (5.12) 
If |o) denotes an eigenstate of the transfer matrix with eigenvalue àg, i.e. 
Ta) = iala) = e ^e |o), (5.13) 


then one can work out the spectral decomposition of the two-point correlation 
function, viz. 


1 
(6092) = 7- i} Diplo G6 Q)e- 141 (5.14) 


= e aE 00-a] 6 (0, y)|0)(0|(0, x)|a). (5.15) 


Here, the quantity (Ey — Eo) is the so-called mass gap, i.e. the energy of the state 
|æ) above the vacuum. For large Euclidean time separations (xo — yo) the lowest 
state dominates the two-point function, i.e. all higher states die out exponentially. 
The spectral decomposition of the two-point function forms the basis for numerical 
simulations of lattice field theories, as the mass (or energy) of a given state is given 
by the dominant exponential fall-off at large Euclidean times (see Sect. 5.2.3). 
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5.2.2 Lattice Actions for QCD 


Our goal now is to find a lattice transcription of the Euclidean QCD action in the 
continuum, i.e. 


4 1 7 
Socp = | d^x -zg Ge Faw) + XO vy (yDutmye) vert. 616) 
0 f-ud,s... 


where go denotes the gauge coupling, and our conventions are chosen such that the 
covariant derivative is defined through 


Dy = 04 + Ay, (5.17) 
while the field tensor reads 


Fav = ðp Av — DAL ApA A =A. (5.18) 


t 
u 
Before attempting to write down a discretized version, we must first elucidate the 
notion of a lattice gauge field in a non-Abelian theory. In fact, in this case it turns 
out that the gauge potential A, must be abandoned when the theory is discretized. 
The reason is that the familiar non-Abelian transformation law, 1.e. 


Au (x) > gG)AuGOgG) | + g(x)du(x)g(x)!, g(x) € SUB), (5.19) 


no longer holds exactly when 9, is replaced by its discrete counterpart d, of 
Eq. (5.5). Strict gauge invariance at the level of the regularized theory cannot be 
maintained in this fashion. 

The definition of a lattice gauge field relies on the concept of the parallel 
transporter. If a quark moves in the presence of a background gauge field from y 
to x, it picks up a non-Abelian phase factor, given by 


U(x, y) = P.O. exp Hf dz, Ay} l (5.20) 
D 


where “P.O.” denotes path ordering, as a consequence of the non-Abelian nature of 
the gauge field. By contrast to the gauge potential A,,, which is an element of the Lie 
algebra of SU(3), the parallel transporter U (x, y) is an element of the gauge group 
itself. On the lattice, the parallel transporter between neighbouring lattice sites x 
and x + aÅ is called link variable: 


U(x,x+afi)=Uy(x), U(x +afi,x) 2 UG, x ad) = Ux). 
(5.21) 
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A consistent and manifestly gauge invariant discretization of QCD is obtained by 
identifying the gauge degrees of freedom with the link variables U,,(x), which 
transform under the gauge group as 


Up) > g(x) UG) g(x af), g(x), g(x af) € SUG). (5.22) 


The connection with the gauge potential A„ (x) is somewhat subtle: if U, (x) 
denotes a given link variable in the discretized theory, it can be used to define a 
vector field A,, (x) as an element of the Lie algebra of SU(3) via 


gau? = U(x). (5.23) 


In turn, if Aj, is a given gauge potential in the continuum theory, one can always 
find a link variable which approximates A7, up to cutoff effects. 

Now we turn to the problem of defining a discretized version of the Yang—Mills 
action. To this end we define the plaquette P y(x) as the product of link variables 
around an elementary square of the lattice: 


Pav (x) = Uy (x) Uy (x + af)Uy (x + aô)! U(x)! (5.24) 


A graphical representation is shown in Fig. 5.1. Using the transformation property 
in Eq. (5.22), it is easy to convince oneself that this object is manifestly gauge 
invariant. Moreover, it serves to define the simplest discretization of the Yang—Mills 
action, the Wilson plaquette action [6] 


SadUl]ep v * (1 = Re Tr Puy). (5.25) 


X€Ag IL«v 


It has become a standard textbook exercise to verify that for small lattice spacings 


1 
SolU] — —7 dx Tr (Fav Fav) + O(a), (5.26) 
80 


provided that one relates the parameter £ to the bare gauge coupling via 6 = 6/ gà in 
Eq. (5.25). We have remarked already that the discretization of a field theory is not 


Fig. 5.1 Graphical ytav X-+0 peta v 
representation of the 


plaquette P,,, (x) in the 

(u, v)-plane. The arrow 
between sites x + afi and x 
denotes the link variable 
U, (x) 


x X+0 u 
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unique, and hence one is free to add further gauge invariant terms to the plaquette 
action which formally vanish as a — 0, but which produce a discretization with 
an accelerated rate of convergence to the continuum limit. The most widely chosen 
alternatives are the Symanzik [7] and Iwasaki [8] actions. 

Quark and antiquark fields, w(x) and W(x), are associated with the lattice sites 
and transform under the gauge group as 


W(x) > goya), Pa) > paga. (5.27) 


Using the transformation property of the link variables, it is straightforward to write 
down a discretized version of the covariant derivative, i.e. 


Vu (x) = — (U GV G + aft) — VG) 


Viv): 


al= Site 


(vœ — UG — aj)" — ai). (5.28) 


where V,, and Via denote the “forward” and “backward” derivatives, respectively. 
Finally, we note that in Euclidean space-time, the Dirac matrices can be defined to 
satisfy {Yu wh = 2óyv. 

Before we attempt to construct the fermionic part of the action of lattice QCD, it 
is useful to identify the basic properties that the discretized, massless Dirac operator, 
D, should satisfy: 


(a) D is local; 

(b) D(p) = ivy pu + O(ap”); 
(c) D(p) is invertible for p 4 0; 
(d) ys D + Dys = 0. 


Locality, i.e. the absence of long-ranged interactions, is a basic property of any 
quantum field theory describing elementary particles. Property (b) implies that 
the correct continuum behaviour of the quark-gluon interaction is reproduced. 
Furthermore, condition (c) ensures that the correct fermion spectrum is obtained: 
fermion masses are associated with poles of (D(Cp))-!, which, in the continuum 
theory, only occur at vanishing four-momentum. Finally, property (d) ensures that 
the massless theory respects chiral symmetry. 

Using the definition of the covariant derivative and the conventions for the Dirac 
matrices in Euclidean space-time, we can now write down the simplest discretized 
version of the massless lattice Dirac operator: 


1 
Dgisc = 3Yu(Vu + Vi. (5.29) 
It turns out, however, that this “naive” discretization violates condition (c) and 


therefore produces spurious fermionic degrees of freedom. This is the so-called 
fermion doubling problem, which is most easily explained by considering Dgi,c in 
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momentum space for the free theory. The Fourier transform yields 


- "x 
Daisc(p) = iyu = sin(apy) = iyu Pu + O(a’). (5.30) 


The discretization procedure has thus replaced p, by a sine function. While 
the Taylor expansion guarantees that condition (b) is satisfied, the occurrence of 
sin(ap,,) implies that Daisc( p) vanishes not only at p, = 0, but also at zr/a for 
u = 0,...,3 in the permitted range of momenta, thereby violating condition (c). 
The massless propagator {Daisc( p)}~! therefore has 2^ = 16 poles, and thus there 
is a 16-fold degeneracy of the fermion spectrum. 

As we shall see below, the fermion doubling problem is closely linked with the 
issue of chiral symmetry on the lattice. For now we simply list the various methods 
that have been devised to address fermion doubling. Historically the first was due to 
Wilson (“Wilson fermions") [6]. Here, the degeneracy is lifted completely, but the 
price to pay is the explicit breaking of chiral symmetry at the level of the regularized 
theory. Another method, due to Kogut and Susskind (“staggered fermions”) [9], 
is based on the idea of spreading individual spinor components over the corners 
of an elementary hypercube of the lattice. Although the degeneracy is only lifted 
partially (from 16 to 4), this formulation has the advantage of leaving a subgroup 
of chiral symmetry unbroken. More recent developments include the use of so- 
called “domain wall” [10, 11] or “overlap” [12] fermions. These formulations leave 
chiral symmetry unbroken in principle, and also succeed in lifting the degeneracy 
completely. Finally, there are the so-called "perfect" actions [13], which are based 
on a renormalization group approach and which are in principle completely free 
of lattice artefacts. An exact realization of the perfect action which can be used in 
simulations is, however, difficult to obtain. In practice, one typically uses a so-called 
truncated fixed point action. Domain wall and overlap fermions, as well as perfect 
actions are particular realizations of a class of discretizations dubbed “Ginsparg- 
Wilson fermions". They have the remarkable feature that chiral symmetry is 
preserved, while the fermion doubling problem is completely avoided. We shall 
come back to this issue in more detail below. 

For now we turn specifically to Wilson's treatment of the fermion doubling 
problem. It exploits the fact that the discretization is not unique. Thus, one can add 
a term to Daisc, which formally vanishes as a — 0, but which pushes the masses of 
the unwanted doubler states to the cutoff scale at any non-zero value of the lattice 
spacing. Explicitly, the massless Wilson-Dirac operator Dw reads 


Dy = 3ya(Vy + VE) + arVEV i, (5.31) 


where r is the so-called Wilson parameter, which is usually set to one. The Fourier 
transform of D, for a trivial gauge field reads 


~ PE 2r . a 
Dy(p) = iY sin(apy) + = sin? (=) à (5.32) 
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which explicitly demonstrates (for the free theory, at least) that the poles at py, = 
x /a receive additional contributions proportional to r/a, which is of order of the 
cutoff for r = O(1). Although this procedure leads to a complete lifting of the 
degeneracy,! it has a number of unwanted features: first, it should be noted that the 
Wilson fermion action differs from the classical action in the continuum by terms 
of order a, as a result of adding the counterterm proportional to r. By contrast, the 
leading discretization effects of the Wilson plaquette action for Yang-Mills theory 
are only O(a’). The Wilson fermion formulation will thus have a reduced rate of 
convergence towards the continuum limit. Secondly, the addition of the Wilson term 
results in an explicit breaking of chiral symmetry, since the massless theory is no 
longer invariant under global axial rotations, such as 


v(x) e*5y(x) ya) > He”, (5.33) 


which implies that property (d) is violated. While the rate of convergence to the 
continuum limit can be accelerated by employing what is known as “O(a) improve- 
ment” (see below), the explicit breaking of chiral symmetry cannot be cured within 
the Wilson theory. Thus, quantities like the quark condensate, which arises from 
the spontaneous breaking of chiral symmetry, cannot be studied in a conceptually 
“clean” manner using Wilson fermions. A detailed discussion how this can be 
achieved with the help of a more sophisticated fermionic discretization (“Ginsparg- 
Wilson fermions’) is presented in Sect. 5.6. However, for most applications of lattice 
QCD, explicit chiral symmetry breaking is merely an inconvenience, but no serious 
obstacle. 

We have already remarked when discussing the discretized Yang—Mills part of 
the QCD action that the non-uniqueness of the discretization opens the possibility 
to construct lattice actions with an accelerated rate of convergence towards the 
continuum limit. A systematic way how to do this is the so-called Symanzik 
improvement programme [14], in which lattice artefacts can be removed order by 
order in the lattice spacing. In a nutshell, the improvement programme amounts to 
extending the renormalization procedure of a field theory to the level of irrelevant 
operators, i.e. operators that formally vanish as a — 0. In this sense one adds 
suitable counterterms, which for any non-zero value of a produce a cancellation 
of the cutoff effects at a given order, provided that their coefficients are tuned 
appropriately. For QCD with Wilson fermions, Sheikholeslami and Wohlert [15] 
have shown that the Symanzik improvement programme to lowest order is realized 
by adding one O(a) counterterm to the Wilson-Dirac operator Dy. The resulting 
expression in the massless case reads 


ia ES 
Dsw = Dy + 4 CswOuv Fav; (5.34) 


'That the degeneracy is indeed completely lifted in the presence of a non-trivial gauge field can be 
verified in numerical simulations. 
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Fig. 5.2 Four plaquettes that 
must be summed over to yield 
the quantity Q,,, (x) in the 
lattice definition of the field 
strength tensor. The site x is 
at the center of the “clover” 
leaf 


where oy) = [Yu yy], and F, 1» is a lattice transcription of the gluon field strength 
tensor Fy. A salable representation of F, uv in terms of plaquette variables is given 
by 


Fua) = zz (Ow - Qv» Q)); (5.35) 


where Q,,(x) is the sum of the four plaquettes emanating from the site x, as 
depicted in Fig.5.2. The object Q,,,(x) is aptly called “clover” leaf. In order 
to remove all lattice artefacts of order a in hadron masses, the improvement 
coefficient c, must be fixed by imposing a suitable improvement condition. 
Without going into details here, we note that it is possible to find such a condition, 
which can also be evaluated at the non-perturbative level [16, 17]. The resulting, 
non-perturbatively O(a) improved Wilson action can then be used to compute, say, 
hadron masses whose values differ from the continuum result by terms of only 
O(a?). 

The Wilson-Dirac operator for a quark with bare mass mo is simply (Dy + mo). 
However, the form of the Wilson fermion action, BY [U, y, V], which is found in 
the literature is usually expressed in terms of the “hopping parameter" « rather than 
mo. By rescaling the fermion fields according to 


W(x) > 2k W(x), VG) > vo) V2«, (5.36) 
one obtains 


SPW. V, V] 9 a* 35 VGO(ODS + mo) v(x) 


X€ Ag 


3 qp. 
-— à - - - 7 
=a S | KY 7 [yoo VuU) + ap) 


X€Ag u=0 


VG + afr + YU pa) VG] 


+ iow}. (5.37) 
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The hopping parameter « is related to the bare mass mpg via 


1 


k= PTEE (5.38) 


while the dimensionless parameter r is usually set to one. Taken together with 
the plaquette action of Eq. (5.25), the Wilson action for QCD is thus conveniently 
parameterized in terms of the bare parameters (£, x), with B = 6/ gi and « as above, 
instead of the bare gauge coupling and quark mass (go, mo). 

Another consequence of adding the Wilson term to the naive lattice action is the 
resulting additive renormalization of the quark mass. In other words, the point where 
the quark mass vanishes is a priori unknown. The value that must be subtracted is 
called the critical quark mass, which corresponds to the critical value of the hopping 
parameter, Kc. The bare subtracted quark mass is then given by 


1/1 1 
m= zz — -) (5.39) 
2a NK Ke 


From Eq. (5.38) one easily infers that the critical value of « in the free theory occurs 
at 


, r — 1, (5.40) 


while for non-zero go the value of ke must be determined, for instance, by adjusting 
« to the point where the pion mass vanishes. 

We now turn to discussing one alternative to using Wilson's solution to the 
fermion doubling problem, namely the so-called “staggered” (or Kogut-Susskind) 
fermions. One might think that the doubling problem arises since there are too 
many fermion degrees of freedom in the discretized theory, if one associates 
a four-component Dirac spinor with each individual lattice site. Pictorially, the 
main idea of Kogut and Susskind was to “thin out” the degrees of freedom by 
distributing single spinor components over different lattice sites. In their particular 
formulation, the 16 corners of a four-dimensional hypercube serve to accommodate 
the individual components of four Dirac spinors. Therefore, if these hypercubes 
are regarded as the main building blocks for the fermionic discretization, rather 
than the lattice sites themselves, this procedure will result in a partial lifting of 
the degeneracy from 16 fermion species down to four. It is clear, though, that a 
simple distribution of spinor components is not sufficient to define the action, since 
the Dirac matrices mix different spinor components. Thus, the staggered fermion 
action is only obtained after performing a diagonalization in spinor space, which 
then decouples the individual components. 

Rather than describing the details of this procedure, which can be found in most 
textbooks, we simply state the result. Starting from the usual four-component spinor 
and performing a spin-diagonalization, the lattice action for staggered fermions with 
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bare mass mo coupled to the gauge field is derived as 


4 
sey, Z. xl =a D 2 [mozo 


x€Ag al 


1 3 
+ 2 ns (0t (00, 00486 + aji) - Za + aUa], (5.41) 
= 


where x, denotes a one-component Grassmann variables. The spin-diagonalization 
has thus replaced the Dirac matrices y,, by real, position-dependent phase factors 
ny (x), which are given by 


no(x) = 1, nj(x)= (—1)ot--nj-i nj — xj/a. (5.42) 


At the level of the classical action, the spinor components are completely decoupled, 
and the action is decomposed into four identical pieces. In order to occupy all 16 
corners of a four-dimensional hypercube with one-component Grassmann variables, 
one needs four Dirac spinors, each of which contributes a term like Eq. (5.41) to the 
overall action. This produces the fourfold degeneracy of staggered fermions, with 
the remnant doubler states being referred to as “tastes”, in order to distinguish them 
from physical flavours. The formulation using the one-component fields within a 
hypercube can be re-expressed in terms of the spin-taste basis [18], from which one 
can infer directly that the taste symmetry is broken. However, one axial generator of 
the taste symmetry remains unbroken. The fermion mass in the staggered approach 
is therefore protected against any additive renormalization through the associated 
global axial U(1) symmetry, unlike the case of the Wilson action. While the various 
tastes decouple in the continuum limit, non-vanishing interactions between the tastes 
at O(a’) in the lattice spacing are induced, leading to large lattice artefacts. The 
Symanzik improvement programme can be employed to reduce these taste-changing 
interactions [19], and the resulting “improved staggered fermions” (the so-called 
“Asqtad”-action being one particular example [20]) have been widely used in a 
series of simulations. 

For a long time lattice physicists have struggled to find a fermionic discretization 
which would both solve the doubling problem and be compatible with chiral 
symmetry. In fact, physicists grew increasingly doubtful that this could be achieved, 
following the proof of a “No-Go theorem” by Nielsen and Ninomiya [21], which 
stated that the conditions (a)-(d) mentioned above could not be satisfied simul- 
taneously. Since one does not want to give up locality and property (b), this 
would imply that either (c) or (d) must be violated. Indeed, the Wilson and 
staggered discretizations seem to confirm this expectation: while the Wilson fermion 
action removes all doublers, it breaks chiral symmetry, leading to an additive 
renormalization of the quark mass, as well as several other consequences. By 
contrast, the staggered formulation preserves a U(1) subgroup of chiral symmetry at 
the price of only partially removing the spurious degrees of freedom. 
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A way to circumvent the Nielsen—Ninomiya theorem was already pointed out by 
Ginsparg and Wilson in 1982 [22], when they suggested to relax condition (d) in 
favour of 


ysD + Dys = aDysD. (5.43) 


However, it was not before 1997 that this condition—now commonly referred to 
as the Ginsparg-Wilson relation—was confronted with a non-trivial solution. It was 
shown [23] that the so-called “perfect action” constructed from a renormalization 
group approach satisfied equation (5.43). It was also realized that any lattice Dirac 
operator, which is a solution to the Ginsparg-Wilson relation, also satisfies the 
Atiyah-Singer index theorem, i.e. 


(ys, D =aDysD ©  index(D) = a? >, iTr (ysD) 2n-—n,, (544) 


X€Ag 


such that the operator D exhibits |n_ — n+| exact chiral zero modes. Finally, it 
was shown [24] that the Ginsparg-Wilson relation implies an exact symmetry of the 
associated action, with infinitesimal variations proportional to 


ôy —ys(1—aD)y, | 8j = vys. (5.45) 


Moreover, this symmetry reproduces the correct chiral anomaly in the flavour singlet 
case, and therefore all the hallmarks of the correct chiral behaviour are present in 
the lattice theory: chiral zero modes, an exact index theorem and the chiral anomaly 
derived from the Ward identities associated with the exact symmetry. 

Another line in the development of lattice fermion actions that preserve chiral 
symmetry goes back to Kaplan's domain wall fermion approach [10], which was 
subsequently applied to QCD by Furman and Shamir [11]. Without going into detail, 
we state that the basic idea is to introduce an extra, fifth dimension and to couple 
the fermions to a mass defect (the so-called “domain wall height") in that extra 
dimension. To make this more explicit, let x, y denote the coordinates in the four- 
dimensional bulk, and s, t the coordinates in the 5th dimension, which has finite 
length N5. The gauge fields are trivial in the 5th direction, and the Dirac operator 
then has the general structure 


Daws(x, 5; y, t) = D! (x, y)ós --8(x — y)DL (5.46) 


where D! (x, y) is the usual Wilson-Dirac operator with a negative mass term, — M, 
which represents the domain wall height. The operator D} couples fermions in the 
5th dimension and contains the physical bare quark mass mo. It can then be shown 
that for mo = O and in the limit Ns — oo there are no fermion doublers and, more 
importantly, chiral modes of opposite chirality are trapped in the four-dimensional 
domain walls at s — 1, Ns. 
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However, in a real lattice simulation of domain wall fermions, one has to work 
with a finite value of Ns, so that the decoupling of chiral modes is not exact. 
One expects, though, an exponential suppression of the remnant chiral symmetry 
breaking effects, and this has been confirmed in several simulations. Furthermore, 
the rate of suppression may be accelerated by optimizing the choice of lattice action 
for the gauge fields. Hence, the domain wall formulation of QCD offers a method 
to realize almost exact chiral symmetry at non-zero lattice spacing at the expense of 
simulating a five-dimensional theory. 

Another operator which correctly reproduces the chiral properties of QCD at 
non-zero lattice spacing was constructed by Neuberger [12]. Its definition is 


py = z(1- =), A=1+s-aDy, @= (5.47) 


where Dy is the massless Wilson-Dirac operator, and |s| < 1 is a tunable parameter. 
By defining Q = —ysA, one can rewrite Eq. (5.47) as 


1 
Dy = = (1 + yssign(Q)) . (5.48) 


The Neuberger-Dirac operator Dy removes all doublers from the spectrum, and can 
easily be shown to satisfy the Ginsparg- Wilson relation [12]. The occurrence of an 
inverse square root in Dy raises two issues. First, it is a priori not clear whether or 
not Dy is local. Second, the application of Dy in a computer program is potentially 
very costly, since the sign-function of the matrix Q must be implemented using, for 
instance, a polynomial approximation. 

In order to qualify as a viable discretization of the quark action, “strict” locality, 
meaning that only fields in a local neighbourhood of a given lattice site are coupled, 
is not actually required. If D(x, y) denotes a generic lattice Dirac operator which 
couples fields at sites x and y, then a sufficient condition for locality of D is the 
exponential suppression of non-local interactions, i.e. 


| D(x, y)l x e Y PV, (5.49) 


where |x — y| is the distance between sites and || - || denotes a suitably defined 
matrix norm. In Ref. [25] it was shown that the Neuberger-Dirac operator Dy is 
local in the sense of Eq. (5.49), provided that the lattice spacing in physical units? 
is not larger than about 0.13 fm. As far as the issue of numerical efficiency is 
concerned, we note that the most widely used approximations of sign(Q) with good 
convergence properties include Chebysheff or Zolotarev polynomials, as well as 
rational fractions. 


?So far we have not discussed how to assign physical units to the lattice spacing a. This is described 
in Sect. 5.2.4. 
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The last fermionic discretization we wish to mention here was originally 
constructed to address another problem of Wilson’s discretization, namely the fact 
that they are not protected against the occurrence of zero modes for any non-zero 
value of the bare quark mass. These unphysical zero modes manifest themselves 
as “exceptional” configurations, which occur with a certain frequency in numerical 
simulations with Wilson quarks and which can lead to strong statistical fluctuations. 
The problem can be cured by introducing a so-called “chirally twisted" mass term, 
after which the fermionic part of the QCD action in the continuum assumes the form 


= J dhe jo) Yu Dy +m + ipgyse? Wx). (5.50) 


Here, lq is the twisted mass parameter, and t? is a Pauli matrix. The standard action 
in the continuum can be recovered via a global chiral field rotation: 


Pajeda Pu) — Qu) e opem. (5.51) 


Fixing the twist angle o by requiring that tana = q/m one finds 


Sk = fex V'G)y,D, + MYE), M= Jm + p2, (5.52) 


which demonstrates the complete equivalence of the twisted formulation with 
“ordinary” QCD. The lattice action of twisted mass QCD for Nr = 2 flavours is 
defined as [26] 


SEU, V. V] 2 a* 35 V GU + mo + ipqyst!) v Go). (5.53) 


X€Ag 


Although this formulation breaks physical parity and flavour symmetries, is has a 
number of advantages over standard Wilson fermions. In particular, the presence of 
the twisted mass parameter uq protects the discretized theory against unphysical 
zero modes. Another attractive feature of twisted mass lattice QCD is the fact 
that the leading lattice artefacts are of order a? without the need to add the 
Sheikholeslami-Wohlert term [27], even though the Wilson-Dirac operator is used 
in Eq. (5.53). Although the problem of explicit chiral symmetry breaking remains, 
the twisted formulation is particularly useful to circumvent some of the problems 
that are encountered in connection with the renormalization of local operators on 
the lattice. Recent review of twisted mass lattice QCD can be found in [28, 29]. 

We wish to end this part with a few general remarks. Although we have 
discussed discretizations of the QCD action in some detail, including the most 
recent developments, many more variants of the basic types of action—including 
several different combinations of fermionic and pure gauge parts—can be found in 
the literature. This reflects the fact that the discretization is not unique. The actual 
choice of lattice action in a particular simulation will influence the convergence rate 
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to the continuum limit, the algorithmic efficiency, the renormalization properties of 
local operators, or—in the case of domain wall fermions—the extent to which chiral 
symmetry is realized. Depending on the properties of a particular discretization, the 
choice of lattice action can be optimized for the physics one wishes to study. 


5.2.3 Functional Integral and Observables 


The lattice formulation provides a regularization of non-Abelian gauge theories 
whilst preserving the gauge invariance at all stages of the calculation. This comes 
at a price, since all continuous space-time symmetries are broken explicitly and 
must be recovered in the continuum limit. Nevertheless, the lattice regularized 
theory inherits all consequences of gauge invariance, including renormalizability. 
Moreover, the lattice regularizes the theory without any reference to perturbation 
theory. By contrast, in continuum schemes like the MS scheme of dimensional 
regularization the cutoff is only defined after fixing the order of the perturbative 
expansion. As we shall see below, observables in lattice QCD are directly given 
in terms of functional integrals, which can be evaluated stochastically using Monte 
Carlo integration. In this way, any use of perturbation theory is completely avoided. 
For concreteness, let us assume that we have made a particular choice for 
the Yang-Mills part SG[U] and the fermionic part SE[U, W, V], for instance, the 
Wilson plaquette action and Wilson fermions. Let €2 denote an observable, which is 
represented by a polynomial in the quark and antiquark fields and the link variables. 
The expectation value, (Q2), is defined through the Euclidean functional integra? 


1 - » 
(Q) = z | Pbi viec simon, (5.54) 
where Z is fixed by the condition(f) — 1. The functional integral involves an 


integration over the gauge group and over all fermionic degrees of freedom, the latter 
being represented by anti-commuting (Grassmann) variables. Since the fermionic 
action, Sp[U, V, w] is bilinear in the quark and antiquark fields, the integration over 
the Grassmann variables is Gaussian and can be performed analytically. This yields 


3 
1 z 
=z f [[ [40 È tdet Dia} e75. (5.55) 
xEAg u=0 


3Here and in the following we drop the subscript “E” on the partition function Z. 
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Equation (5.55) requires some further explanation: 


e Q denotes the representation of Q in the (effective) theory, where the quark fields 
have been integrated out and only the link variables remain in the functional 
integral measure; 

* Da denotes a generic, massive lattice Dirac operator. For instance, for Wilson 
quarks one has Dia; = Dy 4- mo. For simplicity we have displayed the expression 
for QCD with Ny flavours of equal mass mo, which accounts for the power Nr. 
In the case of non-degenerate quarks (det Djat}** must be replaced by a product 
of determinants, in which each factor represents the contribution from a single 
flavour: 

* The lattice formulation has given a well-defined meaning to the measure D[U ]. 
The integration over the gauge degrees of freedom reduces to a finite-dimensional 
integration over the gauge group, based on the invariant group (Haar) measure. 


The numerical evaluation of (32) via Monte Carlo integration proceeds as follows. 
One starts by generating a set of gauge configurations using a computer program. 
One configuration in the set represents the collection of all link variables on a given 
lattice, 1.e. 


IU CÓ [x € Ag, dium 0,3]. (5.56) 


for which we shall use the shorthand {U,,(x)} below. A collection of an infinite 
number of configurations is called an ensemble. The statistical weight, W, of an 
individual configuration is given by 


W = {det Dj4])"* e- 56071. (5.57) 


In other words, the composition of the ensemble is determined by a probability 
distribution, which is given by the negative exponentiated classical action in the inte- 
grand of the Euclidean functional integral. Owing to the weight factor, the integrand 
of the functional integral will be strongly peaked around those configurations for 
which W is large. This particular feature makes the expectation value amenable to a 
Monte Carlo treatment. The key idea is to replace the ensemble by a finite sample of 
Nefg gauge configurations, which is dominated by those configurations for which W 
is large. Provided that one can construct a suitable algorithm, the sample will then 
consist predominantly of those configurations which give a large contribution to the 
Euclidean functional integral and thus (€2). Such a procedure is called importance 
sampling. 

Technically, the sample is produced by generating a sequence of configurations 
via a Markov process: 


{Uw} — [Oh — ...— [UsC)]y, 


cfg i 


(5.58) 
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One assigns a probability for the transition from {U, u (x)}, to {U, u O}, E which is 
usually a function of the statistical weights of the two configurations, W; and W;+1, 
respectively. For each individual configuration in the sequence one then evaluates 
the observable, which yields the estimates Q;, i = 1,..., Ncrg. The expectation 


value (Q) is related to the mean value Q via 


Netg 
— = 1 
Q)= lim Q, Q = Qj. 5.59 
(Q) = lim. Ta i (5.59) 


In other words, in the limit of infinite statistics the mean value converges to 
the ensemble average which is identical to the expectation value. An important 
consequence of approximating the ensemble average by the sample average is a 
non-zero value of the variance. Hence, in order to specify the results from a Monte 
Carlo integration completely, one must also quote the statistical error which is given 
by the square root of the variance. 

In the standard algorithms that implement Markov processes (such as the 
Metropolis algorithm [30]), the transition probabilities for going from one con- 
figuration to another are determined by comparing the statistical weights for local 
variations in the field variables. This guarantees computational efficiency, since the 
variation of individual link variables does not involve global information from the 
entire lattice. In Eq. (5.55) the dynamical effects of the quark fields are incorporated 
via the determinant of the lattice Dirac operator. The determinant, however, is a 
non-local object, which is expensive to compute. When the first efforts were made 
to compute observables in QCD in the 1980s, the available computer power did 
not allow for the inclusion of the quark determinant. Instead, lattice physicists 
resorted to what is known as the “quenched approximation", which is based on the 
assumption that the bulk of non-perturbative contributions is carried by the gauge 
field, so that the determinant is set to a constant: 


Quenched approximation: det Da =1 & M=0. (5.60) 


The resulting gain in computer time amounts to several orders of magnitude. In the 
quenched approximation the effects of virtual quark loops are entirely suppressed. 
As a consequence, results for observables are afflicted with an unknown systematic 
error. As we shall see later, there are several quantities (for instance, the masses 
of the lightest hadrons) for which the quenching error amounts to just 10-15%. 
Although this justifies the use of the quenched approximation to some extent, it is 
clear that dynamical quark effects must be taken into account, in order to arrive at 
reliable, non-perturbative predictions with a total accuracy at the percent level. 
Modern algorithms for dynamical quarks, such as the Hybrid Monte Carlo algo- 
rithm [31], do not evaluate the quark determinant directly. Rather, one exploits the 
property that the determinant can be rewritten as a functional integral over bosonic 
fields, which is then evaluated stochastically. Thereby one avoids computing a 
global object, but the computational effort involved in the stochastic estimation 
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of the quark determinant is still large compared with the quenched approximation. 
More details can be found in Sect. 5.2.6 below. 

Correlation functions, i.e. the expectation values of polynomials in the quark and 
gluon fields, are the most important quantities, since they determine implicitly the 
particle spectrum of the theory. As was discussed already in Sect. 5.2.1, the link 
between correlation functions and the particle spectrum is provided by the transfer 
matrix T. For lattice QCD with Wilson fermions, the existence of a positive transfer 
matrix was rigorously established [32]. 

As a concrete example we shall discuss the two-point correlation function of a 
charged kaon. A polynomial of quark fields with the quantum numbers of the kaon 
is given by 


k(x) = (uyss) (x), (5.61) 


where the parentheses indicate summation over spinor and colour components of the 
fields. Mostly one is interested in correlation functions in which all spatial points 
have been summed over and which therefore only depend on the Euclidean time 
separation. We define 


CkGo: P) = Y 7 e? (ox coo 0). (5.62) 


X 
The inclusion of the phase factor in conjunction with the summation over X amounts 
to a projection onto spatial momentum p. On a finite lattice with periodic boundary 
conditions Cx (xo; p) must be symmetric under x9 « T —xo. Therefore, the spectral 
decomposition of Cy (xo; p) reads 


Gm ie x E eJ [ere 4 entro] (5.63) 


where the sum runs over all states in the kaon channel with fixed momentum p, and 
€ (p) is the mass gap (see Sect. 5.2.1).^ For large Euclidean times xo the ground 
state dominates. If we further set p = 0, then the asymptotic form of the two-point 
function reads 


|(o&x.co)| K]" 


mK 


lim Ck(xo; p) = e "KT cosh (mk(T/2 — xo)), (5.64) 
X9— 00 


where my = eo(p)| p=0 is the mass of the kaon, and the sum of the two exponentials 
has been re-expressed using the cosh function. Owing to the ordering eo(p) < 
€1(p) < ..., the higher excited states are exponentially suppressed. The functional 


^In the commonly normalization of hadron states one includes a factor 2€, (p) in the denominator. 
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form of Eq. (5.64) is nicely illustrated by the plot in Fig. 5.3, where simulation data 
for Ck(xo; p = 0) are compared to its asymptotic form. The data show indeed the 
expected cosh-behaviour. Furthermore, one observes how the contributions from 
higher excited states, which are clearly visible at small values of xo/a, quickly die 
out as the time separation increases. From the two-point function we can extract 
two important quantities: the fall-off of Ck(xo; p = 0) is characteristic of the 
kaon mass, i.e. the energy of the ground state. Moreover, the pre-factor of the cosh- 
function yields the transition amplitude between a kaon state and the vacuum, and 
thus contains information on the kaon’s decay properties. 


5.2.4 Continuum Limit, Scale Setting and Renormalization 


In Sect.5.2.2 we have discussed how to discretize the QCD action. The main 
principle for their construction was the condition that the corresponding expressions 
reproduce the continuum action in the formal limit a — 0, regardless of the values 
of the bare parameters (such as 6 and the hopping parameter « in the case of QCD 
with Wilson fermions). If one goes beyond the classical theory this is not possible 
anymore: it is a general property of quantum field theory that the parameters of 
the regularized theory (masses and couplings) must be adjusted as the regulator is 
removed. In the context of lattice QCD this implies that the continuum limit, a — 0, 
is reached by a suitable tuning of the bare parameters. 

To make this statement more precise, we shall invoke the close connection 
between Euclidean lattice field theory and a system in statistical mechanics. Models 
in statistical physics (think of the Ising model as an example) usually have a phase 
structure. Depending on the choice of parameters, the different phases may exhibit 
entirely different physical properties. The analogy with lattice field theory then 
implies that a particular discretization of QCD also possesses a phase structure in 
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the space of bare parameters (8 and «, for example). We shall now explain that 
the continuum limit of QCD is associated with a critical point in the phase diagram, 
which corresponds to a second-order phase transition. In the previous section we 
have considered hadronic two-point correlation functions, and how the mass in a 
given channel can be extracted from the asymptotic behaviour at large Euclidean 
times. Actually, this procedure yields the dimensionless combination (a M), i.e. the 
hadron mass in lattice units. In order to take the continuum limit, one must take 
a — 0, while the physical mass M must remain constant. This implies 


1 
m6 £ — 0. (5.65) 


In other words, the correlation length € diverges in the continuum limit. In the 
language of statistical physics, a divergent correlation length signals a second- 
order phase transition. The existence of the continuum limit in lattice QCD is 
therefore equivalent to the existence of a second-order transition in the space of 
bare parameters. 

For simplicity we shall now consider Yang-Mills theory on the lattice, which we 
choose to describe by Wilson's plaquette action and the bare coupling parameter 
B=6/ a The existence of a second-order phase transition corresponds to a critical 
value of the bare gauge coupling, go,-. Furthermore, it implies that the bare coupling 
£o and the lattice spacing a (or, equivalently, the correlation length €) cannot be 
varied independently when the continuum limit is approached. In this way we may 
regard the bare coupling as a function of the lattice spacing, go(a), such that 


lim go(a) = 80,c. (5.66) 
a0 


Let P be an observable, computed for a particular value of go, i.e. P = P(go, a). 
Since P is a physical quantity it must stay constant as the continuum limit is taken, 
i.e. 


d 
a — P(go,a) = 0. (5.67) 
da 
This leads to the Callan—Symanzik equation 


ext. P(go,a) = O. (5.68) 
80 


>This phase diagram must not be confused with the physical phase diagram of QCD in the plane 
defined by the temperature and the chemical potential, which is explored at heavy-ion colliders. 


Otherwise, an arbitrarily chosen value of go would always correspond the a critical point. 
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We can define the renormalization group 6-function fiat as 


0g0 
Bat(go) = -a 2, (5.69) 
da 
which describes the change in go when a is varied. Note that fiat depends on the 
choice of discretization. In perturbation theory, however, one recovers the familiar 
universal coefficients at one- and two-loop order. For gauge group SU(N) one has 


Biat(go) = —bogj — bigo + Olga), (5.70) 
where 
EIN |e pu d [25.6 oS, d 
07 uy|3 3] Gt NSN) 
(5.71) 


and Nr = 0 in pure Yang-Mills theory. Starting from the perturbative expansion of 
Piat one can integrate the Callan—Symanzik equation, which gives 


2 
avia = (bogo) "Phe! / emo [1 + oq, (5.72) 


where the integration constant Aja represents a characteristic scale of the theory. 
The above expression establishes the connection between the lattice spacing and the 
bare coupling in perturbation theory. One reads off that 


a>0 $ g— 0, (5.73) 


and hence the critical point occurs at go,; = 0. These findings are a consequence 
of asymptotic freedom. Taking Eq. (5.72) at face value one would conclude that 
the relation between P(a, go) and P(a’, go)» computed for two different values 
of the bare coupling go and go near the critical point, was simply given by the 
ratio of Eq. (5.72) evaluated for go and gy. However, actual simulations do not 
confirm this expectation. The reason for the failure to observe "asymptotic scaling", 
i.e. a variation of P(a, go) with go which is consistent with Eq. (5.72), is that the 
accessible values of go in simulations are by far not near enough the critical point, 
in order for perturbation theory to be a good approximation. 

Let P and P' be two different observables that both satisfy Eq. (5.68). Then, 
regardless of whether or not asymptotic scaling holds, one would expect the ratio 
a P (a, go)/a P'(a, go) to be equal to the physical ratio P/P” for all values of go. 
However, even this weaker scaling criterion is usually not observed, the reason being 
that the right-hand side of Eq. (5.68) is not strictly zero. Rather one has 


ð 0 
lex: = fa P(go, a) = O(a”), (5.74) 
a 880 
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Table 5.1 Most widely used discretizations of the Dirac operator and some of their properties 


Leading 
Action Doublers | artefacts | Chiral symmetry 
Wilson None O(a) Broken 
Clover None [e] (a?) Broken 
Staggered 4 O(a’) U(1) & U(1) subgroup unbroken 
Neuberger None O(a’) | Preserved 
Domain Wall None O(a?) Remnant breaking exponentially suppressed 
Twisted Mass Wilson | None O(a) Broken l MEN 


where p is a positive integer. These so-called scaling violations on the right- 
hand side depend both on the lattice action and the observable in question. As a 
consequence, the ratio considered above behaves like 


aP(a, go) _ 


ee cour P 
Ga O(a"). (5.75) 


In other words, as go is tuned towards zero, dimensionless ratios of observables 
converge to the continuum limit with a rate proportional to a”, where the power p is 
characteristic of the particular discretization employed in the lattice calculation. In 
Table 5.1 we have already listed the leading scaling violations (lattice artefacts) for 
several widely used fermionic lattice actions. Discretizations of the Yang—Mills part, 
such as the plaquette action, have leading lattice artefacts of O(a”). The Symanzik 
improvement programme allows to construct lattice actions with an accelerated rate 
of convergence to the continuum limit. 

In actual lattice calculations, the continuum limit must be taken by performing 
simulations at several different values of the lattice spacing and extrapolating the 
results to a = 0. The functional form of the extrapolation is chosen such that it is 
consistent with the leading discretization errors for a given lattice action. Such a 
procedure is only viable if the relation between the lattice spacing in physical units 
and the dimensionless coupling parameter go (which is an input parameter in the 
simulation) is known with good accuracy. Since the perturbative formula Eq. (5.72) 
is not of any practical use, the relation between the scale and the coupling must be 
mapped out non-perturbatively. To this end one picks a value for go and computes 
in a Monte Carlo simulation a dimensionful quantity Q, whose value is known from 
experiment. Common choices for Q in the pure gauge theory are the string tension 
or the hadronic radius ro [33, 34], while in full QCD one may choose the mass of 
the nucleon. The Monte Carlo procedure yields Q in lattice units, (a Q), and the 
calibration of the lattice spacing is achieved via 


Qlexp [MeV] 


a`! [MeV] = 
(a Q)| 20 


(5.76) 
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Fig. 5.4 Continuum extrapolation of the dimensionless ratio of quark masses and the kaon decay 
constant [35] 


Knowledge of (a Q) over a range of bare couplings is a prerequisite for performing 
the continuum extrapolation. In Fig. 5.4 we show a particular example, namely the 
continuum extrapolation of the combination M; + 3M, + M4) of quark masses, 
normalized by the kaon decay constant, computed using O(a) improved Wilson 
fermions in the quenched approximation [35]. The expected linear convergence in 
a? is clearly exhibited by the lattice data. 

So far we have restricted the discussion to the pure gauge theory which contains 
only one bare parameter, the gauge coupling go (sometimes expressed in terms 
of B = 6/ 85). When quarks are incorporated, the set of parameters must be 
enlarged by the values of the bare masses, one for each flavour. Lattice QCD is 
thus parameterized by the set of bare parameters 


{203 Mu, mq, ms, mc, mp, Mt}. 


In order to be predictive, the theory must be renormalized, by expressing the bare 
parameters in terms of renormalized ones. 

A convenient and practical method for lattice QCD is based on so-called hadronic 
renormalization schemes. Here the bare coupling and quark masses are eliminated 
in favour of renormalized quantities such as hadron masses or decay constants. An 
example how this works in the pure gauge theory was already given in the preceding 
discussion on scale setting, where the bare coupling was eliminated by assigning a 
value in physical units to the lattice spacing. In the process one has to choose a 
quantity that sets the scale and which cannot be predicted anymore. 

Replacing the values of the bare quark masses m,,, mq, . . . in favour of hadronic 
quantities works as follows. Like the bare coupling, the bare quark mass is an input 
parameter for the simulation and thus freely adjustable. Therefore, simulations yield 
hadron masses (in lattice units) as a function of the input quark masses. For instance, 
amps(m|, mo») denotes the mass in lattice units of a generic pseudoscalar meson 
composed of a quark and antiquark with bare masses mı and mz, respectively. Let 
us assume that the lattice spacing a has been calibrated using some input quantity 
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Q. If we further assume exact isospin symmetry we can then determine the value of 
the bare isospin-symmetrized light quark mass m = iom u + mq), by requiring that 


mps(mi,m2) — ms 


Q Q ; mı = m, (5.77) 


exp 


i.e. the value of m is fixed by adjusting the input mass m, until mps (m1, m2)/Q 
coincides with the experimental result. We can extend this procedure to include 
more massive flavours. The bare strange quark mass is found by tuning m2 such that 


mps(A,m;) mk 


5.78 
7 0 (5.78) 


exp 


Alternatively one can fix m; via the condition my (rft, m2)/ Q = m% / Q|exp, where 
my denotes the mass in the vector channel. An example of a particular hadronic 
renormalization scheme is shown below: 


Parameter Quantity 
$0 fn 

1 

7 (m, T mq) |My 

Ms my 

Me mp, 

Mp mg 


All quantities in a lattice calculation are genuine predictions, except for those that 
are listed in the right-hand column of the table, which are used to eliminate the bare 
parameters. 

Given the multitude of hadronic states, it is obvious that there is considerable 
freedom in choosing hadronic renormalization schemes. Usually, masses or mass 
splitings of hadrons that are stable in QCD are suitable to define a scheme. 
Resonances, such as the o, should be avoided, since they do not have a sharply 
defined energy, owing to their large width. 


5.2.5 Limitations and Systematic Effects 


The lattice formulation is the basis for an exact non-perturbative treatment of QCD. 
The accuracy of lattice results is chiefly limited by the algorithmic performance and 
the available computer power. In particular, the set of bare parameters that can be 
simulated efficiently for a given number of lattice sites is restricted. This has the 
important consequence that the quark masses at the very extremes of the physical 
mass scale (i.e. the up/down quarks and the b-quark) cannot be simulated directly 
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with currently available methods and machines. These technical limitations are 
usually translated into a systematic error, which is quoted alongside the statistical 
one. The most important systematic effects are due to 


* lattice artefacts (cutoff effects), 
* finite volume effects, and 
* extrapolations in the quark mass. 


In order to have sufficient control over these effects, the simulation parameters must 
be chosen such that the following inequalities are satisfied: 


1 L 2 
<=, Mhad Ka, (5.79) 
aMhad a 


where haa is the mass of a generic hadron in physical units computed in the 
simulation. The inequality on the left of (5.79) states that the hadron’s correlation 
length must be much smaller than the linear extent of the spatial box (in lattice 
units), as otherwise its value will be strongly distorted by finite volume effects. The 
inequality on the right states that the hadron mass must be significantly smaller 
than the inverse lattice spacing. If this is not the case, lattice artefacts will be 
uncontrollably large, meaning that the presence of higher-order cutoff effects cannot 
be excluded, so that a reliable extrapolation to the continuum limit as a linear 
function of the leading power of lattice artefacts cannot be performed. With current 
algorithms and machines, lattice sizes of up to L/a = 48 and lattice spacings down 
to 0.05 fm are affordable, even if dynamical quarks are included. Since a = 0.05 fm 
corresponds to a^! ~ 4 GeV, it is obvious that the b-quark mass is too large to be 
simulated directly. Several techniques have been devised to address this problem, 
and a brief account can be found in Sect. 5.7.2. 

In the light quark sector, the primary limitation that forbids making direct 
contact with the physical values of the up and down quarks is mostly due to 
algorithmic performance, rather than finite size effects. A detailed discussion of 
the algorithmic difficulties associated with the simulation of light dynamical quarks 
is presented separately in the following section. Moreover, it is difficult even in 
the quenched approximation to reach quark masses significantly smaller than half 
the physical strange quark mass, in particular with Wilson fermions. This is related 
to the occurrence of arbitrarily small eigenvalues in the spectrum of the Wilson- 
Dirac operator, even for small but non-vanishing values of the bare mass. As a 
result, observables computed on individual, so-called “exceptional” configurations 
may differ from the Monte Carlo average by orders of magnitude, and thus a 
reliable determination of the result and its error is virtually impossible. As already 
mentioned in Sect. 5.2.2, the problem of exceptional configuration can be cured 
by employing alternative discretizations such as twisted mass QCD or the overlap 
operator. A related problem arising from the particular spectral properties of the 
Wilson-Dirac operator is the bad performance of standard algorithms for dynamical 
quarks, discussed in the next section. 
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Due to these reasons, many simulations (quenched and unquenched) were 
restricted to quark masses not much smaller than m;/2. This value translates 
into a minimum mass of about 490MeV in the pseudoscalar meson channel, 
so that in these simulations the pion is as heavy as the physical kaon. In this 
region of parameter space it is known empirically that a spatial lattice length 
of 2-3fm is sufficient to satisfy the first inequality in (5.79) and to rule out 
significant finite volume effects. Moreover, an important analytic result derived by 
Lüscher [36], implies that the asymptotic convergence to the result in infinite volume 
is exponential. 

In order to make contact with the chiral regime, lattice results must be extrapo- 
lated to the physical values of the up and down quark masses. The functional form 
for the dependence of observables on the quark mass is usually provided by Chiral 
Perturbation Theory (ChPT). For instance, at lowest order the relation between the 
mass of a pseudoscalar meson composed of quarks with masses mı and mz is 


ms = Bo(mi +m) +..., (5.80) 


where the ellipses represent higher orders in the chiral expansion. Similar expres- 
sions are derived for vector meson and baryon masses, e.g. 


my =m} +C M?» ..., my = m4 +kM?4..., (5.81) 


and also for other quantities such as pseudoscalar decay constants. In the above 
formulae, M? = Bo(m, + m5), and m, and my denote the (non-vanishing) masses 
in the chiral limit. A more formal introduction to the basic concepts of ChPT is 
presented in Sect. 5.6.1. 

It remains largely unknown whether or not the expressions of ChPT considered 
at a given order in the expansion can be applied in the quark mass range that 
is accessible in current simulations. Therefore, chiral extrapolations can lead to 
substantial systematic uncertainties. For instance, lattice predictions for the ratio 
of decay constants of the B and B, mesons, fp,/fg, may differ by 10%, depending 
on whether the LO or NLO expressions are used as an ansatz for the extrapolation 
from quark masses around m; /2. Currently it is estimated that pseudoscalar meson 
masses of 300 MeV and below must be reached in simulations, in order that ChPT 
at one- or even two-loop order provides an accurate prediction for the quark mass 
dependence of hadron masses and matrix elements. 

In the quenched approximation, chiral extrapolations are particularly problem- 
atic, since the chiral limit is intrinsically pathological, due to the appearance 
of singularities in the quark mass dependence. This is illustrated by the NLO 
expression for the ratio mes /(mi + m»), i.e. 


2 


m 
= om E Bo|1 — (5 — 3a0y) (dny+1)+ | eas — a5) — joo | y}, (5.82) 
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where Bo, o5, œg, ô and oq are low-energy constants. For notational convenience 
we have introduced 


M? 2 
= Gah?’ M* = Bo(m, + m»), (5.83) 
where Fo denotes the pion decay constant in the chiral limit. The low-energy 
constants ô and og, which multiply the so-called “quenched chiral logarithms”, 
have no counterpart in the unquenched case. Since 5 has a non-zero value [37], 
the quenched chiral logarithm in Eq. (5.82) gives rise to a singularity in the chiral 
limit. For many applications, the singularity can be ignored, since its effect is 
numerically small even at the physical pion mass. However, it signals that the 
quenched approximation suffers from fundamental conceptual problems. 


5.2.6 Simulations with Dynamical Quarks 


Although one may argue that the quenched approximation describes hadronic 
properties fairly well, it is clearly unsatisfactory, both from a conceptual point of 
view, and also because it introduces an unknown systematic error. Below we shall 
discuss some general issues relating to simulations with dynamical quarks. It must 
be stressed that several different techniques how to treat the quark determinant of 
Eq. (5.57) efficiently are currently being explored. A preferred or clearly superior 
method has not emerged so far, and it is likely that some of the approaches presented 
below may become obsolete in the years to come. 

In order to illustrate the main difficulties, we start by introducing the Hybrid 
Monte Carlo (HMC) algorithm [31], which has been the standard algorithm for 
simulations with dynamical quarks for many years. In order to produce one step 
in the Markov chain, the algorithm evolves the link variables according to the 
equations of motion of a classical Hamiltonian system. To this end one introduces 
a conjugate momentum variable II, (x) for every link U,,(x). The Hamiltonian is 
defined as 


3 
22.2 THU RH SI Sn dad (5.84) 
€ Ap 1,—0 


where SG[U] is the lattice gauge action, and SEIT , 9*, $] denotes an effective 
lattice fermion action, which is obtained by rewriting the quark determinant as a 
functional integral over complex bosonic fields $ (x) and $* (x). Explicitly, for Nr = 
2 one has 


(det Dia)? = f DIB". 1 exp | - Yo w [hD] G85) 


xe€Ag 
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For each step in the Markov chain, the conjugate momenta are drawn randomly from 
a Gaussian distribution (“momentum refreshment”). The Hamiltonian H[U, IT] 
governs the dynamics of the variables U,, (x) and II, (x) with respect to “simulation 
time" t, which parameterizes the evolution of U,,(x) and II, (x) as the simulation 
algorithm progresses. The evolution is described by Hamilton’s equations, which 
read 


d d 
ac 0s 09 = My (x)U p(X), az o = —Fa,u(x) — Fry), (5.86) 
where 
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(5.87) 


are the forces associated with the gluon and quark fields, respectively. The algorithm 
then proceeds by integrating the equations of motion numerically. As in any 
numerical integration scheme, the total time interval is divided into a number of 
sub-intervals of finite length At, which is called the step size. Starting from an 
initial gauge configuration (U,, (x)) and a set of conjugate momenta {IT,,(x)}, one 
obtains new sets {U " (x)}, m, (x)} after the integration. In the language of classical 
mechanics, the variables U„(x) and IIj(x) evolve along a trajectory in phase 
space which connects the initial and final configurations. However, since numerical 
integration is not exact, owing to the finite step size, the energy is not conserved. 
In the HMC algorithm this is rectified by introducing a global accept/reject step: if 
AH denotes the energy difference between the initial and final configurations, i.e. 


AH = H[U', W] — H[U, IIT], (5.88) 
then the new configuration {U " (x)) is accepted with probability’ 
P(U > U') = min(1, e^ ^P), (5.89) 


In other words, a configuration (U " (x)} associated with a large value for the energy 
violation AH is less likely to be accepted. This final step completes the Monte 
Carlo update. The name “Hybrid Monte Carlo” reflects the fact that one combines 
a deterministic classical dynamics procedure with a pseudo-random accept/reject 
step. 

One major problem which has plagued simulations with dynamical quarks over 
many years is the fact that the efficiency of the conventional HMC algorithm 


—AH 


7In practice this is achieved by drawing a random number r, with 0 < r < 1. Ifr < e the new 


configuration is rejected. 
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deteriorates sharply when the lattice spacing is decreased and the masses of the 
light (up and down) quarks are tuned to their physical values. The poor scaling 
behaviour is driven by the condition number of the lattice Dirac operator Dia, 
i.e. the ratio of the largest to the smallest eigenvalue. This quantity is known to 
grow inversely proportional to the lattice spacing and the quark mass. In particular, 
the HMC algorithm scales with the second, perhaps the third power of the light 
quark mass. Thus, simulations based on the Wilson-Dirac operator were found to be 
unpractical for lattice spacings below 0.1 fm and quark masses significantly smaller 
than half of the strange quark mass.? This is related to the afore-mentioned fact that 
even the massive Wilson-Dirac operator is not protected against arbitrarily small 
eigenvalues. Its condition number may thus fluctuate strongly in the course of the 
simulation, leading not only to numerical instabilities, but also to large fluctuations 
in the quark force term Fr, , (x), and, in turn, AH. In order to keep a reasonably large 
acceptance rate of well over 75%, one must reduce the step size At accordingly, 
and thus the numerical effort to integrate the equations of motion for an interval t 
of fixed length, increases. 

Two basic strategies to address this problems have been followed: the first is 
based on using fermionic discretizations that avoid the problem of arbitrarily small 
eigenvalues, while the aim of the second approach is to improve the simulation 
algorithms. 

Staggered fermions have been advocated as a numerically more efficient alter- 
native to the Wilson-Dirac formulation: since the staggered Dirac operator couples 
one-component Grassmann fields rather than four-component spinors, fewer float- 
ing point operations are required for one application of the operator. Moreover, the 
residual U(1) & U(1) symmetry protects the quark mass against additive renor- 
malization and thus prevents the occurrence of very small eigenvalues. However, 
the fact that the staggered formulation describes four "tastes" per quark flavour 
makes a physical interpretation difficult. Technically, the degeneracy implies that 
the statistical weight of the quark determinant is too large compared with that of 
one physical flavour. An ad hoc method to compensate for this is to take fractional 
powers of the staggered quark determinant. For instance, to simulate QCD with 
a doublet of degenerate up and down quarks with mass m, and a single heavier 
(strange) quark with mass ms, the probability measure is taken as 


1 a\y1/2 1/4 .— 
P = 7 {det (Dag + 8)] ^ [det (Digg + m;)] ^ e 5901, (5.90) 
where Dstagg is the massless staggered Dirac operator. This procedure is known as 
the "fourth root trick". The main question, which has been hotly debated, is whether 
or not the rooted staggered operator corresponds to a local field theory, or whether it 
induces spurious interactions among the fermionic degrees of freedom, which might 


lead to a violation of the universality of the continuum limit. A thorough analysis 


*This should be compared to the physical mass ratio of m ~% m, /24 [38]. 
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of this problem was given in [39], but so far no firm conclusion has been reached. 
Nevertheless, the probability measure Eq. (5.90) and the “rooting trick” it is based 
on, have been employed in large-scale simulations (see, e.g. Ref. [40]). 

Discretizations based on twisted mass QCD have also been proposed as a 
numerically more efficient quark action. Here, the twisted mass parameter jq 
protects the operator against arbitrarily small eigenvalues. The smallest mass in the 
pion channel that has been reached with this formulation was as low as 300MeV 
[41]. This corresponds to a physical quark mass of about ms/5, which may be 
sufficient to enter the regime where the quark mass behaviour of observables can 
be described analytically using Chiral Perturbation Theory. 

Owing to several major algorithmic improvements, simulations based on the 
Wilson-Dirac operator can now be performed much more efficiently. Without going 
into much detail, we simply state that most of the gain is due to the use of suitably 
chosen factorizations of the Wilson-Dirac operator into its low- and high-frequency 
parts. The various factors are then “better conditioned”. In particular, fluctuations in 
the condition number can be controlled via a separate and optimized treatment of 
the low-energy part. In this way the step size At can be increased whilst keeping 
a reasonably high acceptance rate for fixed total trajectory length t. Algorithmic 
implementations of factorization range from Hasenbusch's “mass precondition- 
ing" [42, 43], Lüscher's domain decomposition technique based on the Schwarz 
Alternating Procedure (DD-HMC algorithm) [44], to factorizations based on mass 
preconditioning combined with rational approximations of the contributions from 
multiple pseudo-fermion fields [45]. Thanks to these developments, it appears that 
the spectral properties of the Wilson-Dirac operator are no longer an obstacle to 
the efficient simulation of lattice QCD with light dynamical quarks. At the same 
time, large-scale simulations employing the recent algorithmic improvements are 
only just starting. 


5.3 Hadron Spectroscopy 


The determination of the spectrum of hadrons, i.e. mesons, baryons, glueballs, 
and possibly "exotic" hadronic states, starting from the underlying gauge theory 
of quarks and gluons has traditionally been one of the main applications of 
lattice QCD. The róle of lattice calculations in this context is twofold: first, the 
determination of the experimentally known values of hadron masses from first 
principles represents a stringent test of QCD. Second, lattice calculations can make 
predictions for the masses of undiscovered or poorly established states. For instance, 
lattice results have been instrumental in the search for glueball candidates, and have 
also contributed significantly to the debate on the existence of pentaquarks. 

The principles of hadronic mass calculations have already been outlined at the 
end of Sect. 5.2.3: After defining a suitable interpolating operator with the quantum 
numbers of the desired hadronic channel, one computes its Euclidean two-point 
function. The mass (energy) of the ground state in that channel is then extracted 
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from the exponential fall-off of the correlation function at large Euclidean times. 
The detailed functional form of the asymptotic behaviour depends on the choice of 
boundary conditions. Thus, it is not always described by a cosh function, as in the 
example of a pseudoscalar meson on a lattice with periodic boundary conditions 
in time, c.f. Eq. (5.64). In the limit of infinite temporal lattice size T, the effect 
of the boundary conditions is sufficiently weak, so that one may approximate the 
functional form of the correlation function for a generic interpolating operator 


Phad (x) by 


Chaa(xo; p) = ) el? * (braaa O) T£? Y welpje eP, ^— (591) 


Here, the quantity wa (p) is referred to as the spectral weight of the state |o). A 
large value for the spectral weight of the ground state, wi (p), will lead to an early 
domination of the correlation function by the ground state energy. The choice of 
Qnhad in a given channel can be optimized such that 


wi) > wi. i-22,3,... (5.92) 


An optimal choice of interpolating operator is not only important to ensure a reliable 
determination of the ground state energy: In order to determine the energies in 
the excitation spectrum, the associated spectral weights must be maximized by 
specifying appropriate operators. 

Below we provide examples for interpolating operators in several mesonic and 
baryonic channels: 


K-meson: x = (sysu), (syoysu) 


K*-meson: x« = (syjū), jJ=1,2,3 
(5.93) 
nucleon : oN = abe (u^ Cys d?}uc 
A > da = Eabe{uee Cy, duc 


Here, parentheses indicate summation over spinor and colour indices, while curly 
brackets denote that only spinor indices are summed over. 


5.3.1 Light Hadron Spectrum 


The determination of the spectrum of light hadrons was historically one of the first 
attempts to compute hadronic properties on the lattice. Since the masses of the low- 
lying hadrons are known from experiment, such calculations serve as benchmarks 
to test the intrinsic accuracy of the lattice approach. 
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The quenched approximation has been widely used to compute a number of 
quantities that are of great phenomenological interest. However, these results 
are of limited value, since the inherent quenching error is left undetermined. A 
precise calculation of the masses of the lowest lying hadrons in quenched QCD 
will expose the typical magnitude of the systematic error incurred by neglecting 
dynamical quark effects. To this end, several calculations of the quenched light 
hadron spectrum, using different lattice actions, have been performed [46-51]. 

In Ref. [47], the CP-PACS Collaboration presented a comprehensive study of 
the masses of the lowest pseudoscalar and vector mesons, as well as octet and 
decuplet baryons. The Wilson fermion action without O(a) improvement was 
used at four different values of the lattice spacing, and a continuum extrapolation 
linear in a has been performed for all quantities. CP-PACS adopted a hadronic 
renormalization scheme in which the lattice scale was fixed using the mass of 
the p-meson. The average up and down quark mass was set using my. In order 
to fix ms, either the kaon mass ("K'-input) or the mass of the $-meson (*$"- 
input) was used. Chiral extrapolations were either based on the form expected from 
quenched Chiral Perturbation Theory at NLO (see Eq. (5.82)), or on the leading- 
order formula supplemented by a quadratic term in the quark mass. The resulting 
(small) differences in the extrapolated values were added as systematic errors in the 
final results, which are summarily displayed in Fig. 5.5. Although the lattice results 
are in remarkable overall agreement with the experimentally observed spectrum, one 
finds significant deviations. For instance, the ratio of the nucleon and the p-meson 
masses is determined as 

HN 


— = 1.143 + 0.033 + 0.018, (5.94) 


Mp 


where the first error is statistical, and the second is an estimate of systematic 
uncertainties other than quenching. The above value is 6.7% (2.5 standard devia- 
tions) below the experimental value of 1.218. Similarly, vector-pseudoscalar mass 
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splittings, such as mg» — mx, are underestimated by 10-15% (4—60 ), depending on 
whether my or mg was used to fix the strange quark mass. 

The findings reported by CP-PACS, which were based on unimproved Wilson 
fermions, have been broadly confirmed by other collaborations employing differ- 
ent lattice actions [48-51]. Thereby, the universality of the continuum limit of 
quenched QCD has been established: although different discretizations may yield 
statistically inconsistent results at non-zero lattice spacing, they converge to a 
common continuum limit, provided that the same hadronic renormalization scheme 
has been employed. The latter requirement is important, as there is considerable 
freedom in choosing a particular scheme. This leads to ambiguities in the quenched 
approximation, since different quantities are affected in different way by quark 
loops. In Ref. [51] it was found that, by using only stable or narrow states to define 
the hadronic renormalization scheme, the discrepancies between the quenched and 
experimental spectra could be shifted to the broad resonances, p, ^, N*, while the 
agreement for states like K, $, N, €2 could be improved. Yet this observation does 
not alter the conclusion that the quenched approximation is unable to reproduce the 
spectrum of light hadrons with an accuracy better than 10%. 

The obvious question is whether sea quark effects can account for the observed 
deviation between the quenched and experimental spectra. Owing to the larger 
numerical effort required to simulate QCD with dynamical quarks, unquenched 
studies have not yet reached the same level of control over systematic effects— 
notably lattice artefacts and chiral extrapolations—compared with the quenched 
benchmark [47]. Thus, a “definitive” unquenched calculation of the light hadron 
spectrum is still lacking, and thus we refrain from presenting an overview of recent 
results. 

Nevertheless, the observed tendency in all simulations performed to date is 
that dynamical quarks “do the right thing", i.e. the deviation from experiment is 
decreased. An example is shown in Fig.5.6, where continuum extrapolations of 
meson masses in the quenched and unquenched theories are compared. The plot 
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shows that the data obtained for Nr — 2 are closer to the experimental results in 
the continuum limit in comparison with their quenched counterparts. However, the 
figure also shows that the extrapolation of unquenched data is not well constrained, 
since only three data points are available. Clearly, additional simulations at smaller 
lattice spacings and quark masses are required for a solid determination of the total 
error in unquenched calculations of the light hadron spectrum. 

It should also be noted that the various discretizations of the quark action 
have complementary advantages and shortcomings. While simulations with Wilson 
quarks have in the past been restricted to quark masses not much smaller than half 
the strange quark mass for algorithmic reasons, the use of staggered fermions in 
conjunction with the rooting procedure may be afflicted with conceptual problems 
(see the discussion in Sect. 5.2.6). Domain wall and overlap fermions are per se 
more expensive to simulate. In simulations based on tmQCD the incorporation of a 
third, heavier quark flavour is quite complicated. Thus, progress in this area is likely 
to be made through the combined information from different discretizations. 


5.3.2  Glueballs 


In addition to bound states composed of a quark-antiquark pair or, alternatively, 
three quarks, QCD is also widely believed to support the existence of glueballs, i.e. 
bound states consisting mainly of gluonic degrees of freedom. Although several 
candidates for such states have been proposed (e.g. the f9(1370), fo(1500) and 
fo(1710)), the experimental difficulty consists in their unambiguous identification 
as glueballs. To this end, they need to be distinguished from “conventional” flavour- 
singlet meson resonances in the scalar channel. Predictions for the masses and 
widths of glueballs from lattice QCD provide crucial input for this task. 

The basic principles of mass calculations for glueballs in lattice QCD are the 
same as for bound states composed of quark degrees of freedom: first one must 
define an interpolating operator with the appropriate quantum numbers of the 
glueball state in question. That is, the operator must transform correctly under 
spin, parity and charge conjugation. At this point a complication arises: the lattice 
breaks all continuous space-time symmetries, such that Lorentz-invariance or—in 
the language of Euclidean field theory—rotational invariance is only recovered in 
the continuum limit. At non-zero lattice spacing the spin assignment is therefore 
ambiguous. Since the gluon field is represented by link variables, any glueball 
operator must be constructed from particular combinations of Wilson loops, i.e. 
products of link variables along closed paths on a hypercubic lattice (see Fig. 5.7). 

Operators constructed in this way transform under irreducible representations 
(IRs) of the octahedral group Op, which are conventionally labelled A1, A», E, Ti 
and 7». By computing the relations between the IRs of Oy and SU(2) one finds 
that each IR in the set (A4, A2, E, Tj, T2} corresponds to infinitely many spins in 
the continuum. For instance, A, transforms not only like a scalar (spin 0) state, but 
also contributes to spin 4 and yet higher spin states. Similarly, the lowest states to 
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Fig. 5.7 Wilson loops used in the construction of glueball operators (from Ref. [53]) 


which 7; makes a contribution are spin 1 and spin 3, while E corresponds to spins 
2, 4, 5,.... In order to fully classify lattice glueball operators, the representations 
of Oy are supplemented by the transformation properties under parity and charge 
conjugation, in full analogy with the usual J ^C-assignment in the continuum. For 
example, an operator labelled AT" corresponds to the scalar channel 0** in the 
continuum. 

The above discussion implies that the two-point correlation function of an 
operator transforming under a2 which is used to describe the scalar glueball, 
will be contaminated by contributions from a spin 4 state. However, in accordance 
with Regge theory one may expect that the latter dies out quickly, since higher spin 
states are more massive. 

Another technical complication arises from the empirical observation that the 
spectral weight, wi(p), of the ground state in Eq. (5.91) is usually quite small. 
This implies that the asymptotic behaviour of the two-point correlation function is 
only isolated at large Euclidean times. However, the statistical accuracy deteriorates 
quickly as xo is increased, and in the asymptotic regime the correlation function is 
numerically comparable to the statistical noise. This precludes a precise determina- 
tion of the mass of the ground state. A heuristic explanation for the small spectral 
weight can be given by noting that the operators constructed from the usual link 
variables are point-like and thus have little projection onto an extended object such 
as a glueball. The situation can be much improved if the links in the Wilson loops of 
Fig. 5.7 are replaced by so-called “smeared” or “fuzzed” links [54, 55]. For instance, 
the approach of [54] replaces the spatial link U; (x) by the combination 


3 
Uj(x) =U — PiU aa M OUE Fak)U(G aj | j= 1,2,3, 
rk-lkzj 


(5.95) 


where a is a real, tunable parameter, and the symbol 7? denotes the projection back 
into the group manifold of SU(3). The procedure can be iterated, so that links at 
smearing level s, i.e. U; (x), are constructed from those at level s — 1 via Eq. (5.95). 
One may say that smearing reduces the UV fluctuations of the gauge field, so that 
the smeared, extended link variables are better suited to project onto the IR regime, 
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i.e. the long-distance properties. It should be stressed that the links in the temporal 
direction do not undergo the fuzzing procedure: Fuzzed temporal links will alter the 
transfer matrix and the spectral information it contains. 

In order to obtain detailed information on the glueball spectrum one also seeks to 
determine the masses of the excited states in a given channel. This requires another 
level of refinement, since one normally hopes that excited state contributions die out 
quickly, while they now become the very focus of interest. A widely used method to 
gain information on the higher excitations is to construct a whole set of interpolating 
operators (O1,..., O-} in a given channel, say, A This is achieved either by 
considering different shapes of Wilson loops that share the same transformation 
properties, or by applying several different smearing levels to one particular Wilson 
loop. Thus, each individual member of the set {O1,..., O,} is a perfectly valid 
operator 2 2 given channel, but the projection prenésies: i.e. the associated spectral 
weights wi? for a particular state o in the spectral sum will in general be different 
for each member i = 1,...,r. One then computes the matrix 


Ci (x0) = (0:000). íjo2l..an (5.96) 


x 


whose elements consist of the correlations of all combinations of operators in the 
set. The diagonalization of the matrix correlator then yields the appropriate linear 
combination of operators which correspond to the states æ = 1, 2, ... in the spectral 
decomposition. Diagonalization is achieved by solving the generalized eigenvalue 
problem 


Cij (xop; = Ai (x0, xp) Ci&(xg)dk. — X < xo, (5.97) 


where ¢ denotes a vector, XQ is fixed, and C(xo), C (xp) denote the matrix correlators 
taken at Euclidean times xo and Xo» respectively. As shown in [56], the set of 
eigenvalues À (xo, xo) converges rapidly towards 

Ag (x0, Xp) = e C073 6. a= l,...,r, (5.98) 
where €g is the mass (energy) of the state œ in the spectral sum. 

After all these technicalities, we now report on the status of glueball calculations. 
Recent results obtained in the quenched approximation were published in [53, 57- 
60]. In Fig. 5.8 we show the results from Ref. [57]. The three lowest-lying states are 
the scalar (0**), tensor (2^ ^) and the 0^ * glueballs, whose masses are determined 
as 


mo++ = 1710(50)(80) MeV, m++ = 2390(30)(120) MeV, 
mo-+ = 2560(35) (120) MeV. (5.99) 
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Fig. 5.8 Glueball spectrum 
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Here, the first error is statistical, while the second is an estimate of systematic 
uncertainties, which is dominated by the ambiguity in the scale setting in the 
quenched approximation. 

While it is tempting to identify the experimentally established resonance 
fo(1710) as a scalar glueball in the light of the above results, the situation is 
more complicated. Since lattice predictions for the mass of the lightest glueballs 
fall into the mass range of conventional scalar mesons, mixing of glueballs with 
conventional qq states in conjunction with the observed decay patterns must be 
considered before drawing any definite conclusions. More details on the current 
phenomenological and experimental situation can be found in [61, 62]. So far, there 
have been only exploratory attempts to study glueball-meson mixing directly on 
the lattice. Any meaningful investigation must inevitably include dynamical quark 
effects, whose influence on the glueball spectrum have so far only been poorly 
understood. 


5.4 Confinement and String Breaking 


The empirical fact that quarks and gluons are not observed as free particles is 
commonly referred to as confinement. Since all experimentally observed states 
are singlets under SU(3)colour, confinement is tantamount to saying that isolated 
colour charges are not allowed. A theoretical understanding of this phenomenon 
must inevitably go beyond the perturbative level, since QCD is a strongly coupled 
theory. 

In Ref. [6], Wilson formulated a criterion for the confinement of colour charges 
known as the “area law". Let U (C) denote the product of link variables around a 
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Fig. 5.9 Oriented product of link variables around a rectangle of area r - t 


closed loop C on a hyper-cubic lattice. The trace over colour indices is called the 
“Wilson loop", i.e. 


W(C) = tr(U(C)). (5.100) 


The area law then states that colour charges are confined if the expectation value of 
W(C) decays exponentially with a rate proportional to the area A(C) enclosed by 
the curve C, i.e. 


(W(C)) = (tr(U(C)) x e-7^€, (5.101) 


where c is a constant. An example for a rectangular Wilson loop is shown in Fig. 5.9. 

The interpretation of the area law rests on the observation that a Wilson loop 
of area r-t is equal to the Euclidean correlator which describes the propagation of 
a static, i.e. infinitely heavy, quark-antiquark pair separated by a distance r over a 
Euclidean time interval t. If t is taken to infinity at fixed r, the correlator yields the 
energy of the quark-antiquark pair: 


qw (o) Z e- vor. (5.102) 
The area law then implies c A(C) — V (r)t, and for a rectangular loop one obtains 
V(r) ~or. (5.103) 


Hence the energy of a static quark-antiquark pair increases linearly with the 
distance r. To achieve a full separation of static colour sources would therefore 
require an infinite amount of energy. 

It has long been believed that SU(3) gauge theory is related to some kind of 
string theory. Heuristically, confinement may be viewed as due to the formation of 
a narrow tube of chromo-electric and -magnetic flux between static colour charges, 
the dynamics of which can be described by a string theory. The bosonic string model 
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yields an asymptotic expansion for the static quark potential 
Vr) = or Vo  & 01/75), (5.104) 
where Vo — const, and the universal coefficient c has been computed as [63] 
c= -— (5.105) 


in the four-dimensional theory. The proportionality factor o is called the “string 
tension". Instead of the potential one often considers the force, F(r) = dV(r)/dr. 
The ansatz Eq. (5.104) yields 


F(r) =o - 5 + Or), (5.106) 
r 
so that the string tension is obtained as the limiting value of the force, as r — oo, 
c = lim F(r). (5.107) 
roo 


String models of hadrons have been known since the late 1960s, and a phenomeno- 
logical value for o has been determined from Regge theory, ./o = 440 MeV. 

In QCD with light sea quarks the linear rise of the potential cannot persist for 
arbitrarily large distances. Instead, the creation of a light quark-antiquark pair from 
the vacuum will cause the hadronization of the static colour charges, leading to 
the formation of two static-light mesonic states. Thus, the string or flux-tube is 
expected to “break” when the two-meson state is energetically favoured over the 
linearly rising potential. The breaking of the string should set in at a characteristic 
value for the separation distance, rp, causing the potential to flatten off for r Zr, 
since the energy of a state of two mesons is independent of their separation. 

Lattice simulations have been instrumental for establishing that the area law, the 
string picture of confinement, as well as string breaking (i.e. hadronization) are 
indeed properties of SU(3) gauge theory and/or QCD. However, computations of 
large Wilson loops in lattice simulations suffer from the same problem encountered 
in glueball mass calculations: due to the strong exponential fall-off, the correlator 
in the asymptotic region, r,t — oo, is of the same order of magnitude than the 
statistical noise. Consequently, the same techniques have been applied, namely 
the smearing of link variables and the variational approach, which is based on 
the diagonalization of a matrix correlator. By combining these techniques with 
procedures designed to reduce statistical fluctuations [64] in the computation of 
large Wilson loops, one could verify the linear rise of the potential up to distances 
of r < 1.5 fm [65, 66] (See Fig. 5.10). 

Since a phenomenological value for ,/o could be inferred from Regge theory, the 
string tension used to be a popular quantity to set the lattice scale. However, as lattice 
calculations became increasingly precise, it was realized that the extrapolation 
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Fig. 5.10 Left panel: static quark potential in SU(3) gauge theory (from Ref. [66]). Right panel: 
force (from Ref. [65]) compared to the bosonic string model (dashed curve) and perturbation theory 
(solid curve). To compare results at different lattice spacings, all dimensionful quantities have been 
expressed in units of the hadronic radius ro = 0.5 fm (see text) 


r — œ is not easy to perform on the basis of lattice data restricted to r < 1.5 fm. 
An alternative, conceptually much more reliable scale is obtained from the force 
between static colour charges [33]. The hadronic radius ro is defined by requiring 
that the force F(r) evaluated at r — ro assumes a given reference value. The latter 
is fixed by matching F(r) to phenomenological, non-relativistic potential models 
for heavy quarkonia. The scale ro is defined as the solution of 


F(r)r? — 1.65, (5.108) 
r 


=ro 


where the constant on the right-hand side is chosen such that ro has a value of 
r = 0.5 fm in QCD. Choosing rg to set the scale avoids the systematic uncertainty 
associated with the extrapolation of the force to infinite distance. Furthermore, ro 
remains well-defined in QCD with dynamical quarks, where string breaking must 
occur and the concept of a string tension as the limiting value of the force is 
intrinsically flawed. The quantity ro/a has been determined numerically with good 
statistical accuracy over a wide range of bare couplings, corresponding to lattice 
spacings between 0.026 — 0.17 fm [34, 65]. 

To test whether the bosonic string model for confinement is consistent with lattice 
data, one must confront the value of the Coulombic coefficient c in Eq. (5.104) with 
the predicted value of c = —z/12. As in the case for the string tension, such a 
comparison is difficult to perform reliably, since —77/12 represents the asymptotic 
value at infinite distance, which must be determined from data computed over a 
narrow range of accessible distances. Using highly accurate data for the potential 
V(r), generated by an algorithm which allows for an exponential suppression of 
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statistical fluctuations at large r and t, it could be shown [67] that the quantity 


,QV(r) 
dr? 


1 
Ceff (r) = rd (5.109) 


indeed converges towards the predicted value of —x /12. This result confirms the 
string picture of confinement and suggests that string-like behaviour already sets in 
at rather small distances of r Z 0.5 fm. 

The incorporation of dynamical quarks should drastically change the string 
picture beyond a characteristic scale rp, where due to qq pair creation string break- 
ing occurs, since a two-meson state is energetically favoured over the flux-tube. 
However, the static quark potential determined from Wilson loops on dynamical 
configurations typically does not show any clear signs of flattening off, even at 
distances as large as 1 fm, where one expects hadronization to set in. This is 
attributed to the Wilson loop having little overlap onto the state of a broken 
string, such that the spectral weight associated with the broken string is extremely 
small. Therefore, extracting its energy reliably would require large Euclidean time 
separations, for which the statistical signal is usually lost. 

It was thus proposed to address this problem by constructing a matrix correlator 
of Wilson loops supplemented by operators that directly project onto a two-meson 
state, and to consider their cross-correlations with the unbroken flux-tube. This 
strategy was first applied to Higgs models, i.e. non-Abelian gauge theory coupled 
to bosonic matter fields (“scalar QCD”), which are computationally much more 
efficient, whilst preserving the mechanism for string breaking to occur [68, 69]. The 
method was later extended to QCD with two flavours of dynamical quarks [70]. 
The plots in Fig. 5.11 clearly show that the ground state energy at short distances is 
linearly rising, while the first excited state (i.e. the two-meson state) is constant in r. 
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Fig. 5.11 Ground state and first excited state of the static quark potential computed using matrix 
correlators in the SU(2) Higgs model [68] (left panel) and QCD with Nf = 2 flavours of dynamical 
quarks [70] (right panel) 
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At a certain separation rp one observes a crossing of energy levels and a continuing 
flat behaviour of the ground state energy. Near the crossing point one actually 
observes a repulsion of the energy levels, which is characteristic for the breaking 
phenomenon. The diagonalization of the matrix correlator also yields information 
on the composition of the states in the spectral decomposition. Indeed, for distances 
r « rp the combination of operators describing the ground state is dominated by 
Wilson loops, whereas for r > rp, two-meson operators are the most relevant. 


5.5 Fundamental Parameters of QCD 


We have noted already that QCD is parameterized in terms of the gauge coupling 
and the masses of the quarks. In order to make predictions for cross sections, decay 
rates and other observables, their values must be fixed from experiment. As was 
discussed in detail in Sect. 4.3 , the renormalization of QCD leads to the concept of 
a "running" coupling constant, which depends on some momentum (energy) scale 
IL, and the same applies to the quark masses’: 


gu) 
4r 


Os (LL) = » My (KL), Ma(L), ms(i), meu), mp(a), mia). (5.110) 
The property of asymptotic freedom implies that the coupling becomes weaker as 
the energy scale u is increased. This explains why the perturbative expansion of 
cross sections in the high-energy domain allows for an accurate determination of as 
from experimental data. 

The scale dependence of the coupling and the quark masses is encoded in the 
renormalization group (RG) equations, which are formulated in terms of the £- 
function and the anomalous dimension T, 


8g) 


"OS = B(g), 


am) - 
ðu 


mc(g). (5.111) 


At high enough energy the RG functions f£ and t admit perturbative expansions 
according to 


BE) = —bog? — big) +..., t(g) = —dog? — dig? +... (5.112) 


Here, bo, bı and do = 8/ (Ax)? are universal, while the higher coefficients depend 
on the adopted renormalization scheme. 


° As usual we denote the running parameters by a bar across the symbol. 
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From the asymptotic scaling behaviour at high energies one can extract the 
fundamental scale parameter of QCD via 


A= Jim [Mg P], = BW). (5.113) 


Like the running coupling itself, the A-parameter depends on the chosen renormal- 
ization scheme.!° A related, but less commonly used variable is the renormalization 
group invariant (RGI) quark mass 


Mi | ir(2bog?) ^t], f=u,d,s,..., m=m(u). (5.114) 


lim 
JL — 00 
Unlike A, the RGI quark masses are scheme-independent quantities. Instead of 
using the running coupling and quark masses of Eq. (5.110), one can parameterize 
QCD in an entirely equivalent way through the set 


A, My, Ma, Ms, Mc, Mp, Mi. (5.115) 


At the non-perturbative level these quantities represent the most appropriate param- 
eterization of QCD, since their values are defined without any truncation of 
perturbation theory. 

The perturbative renormalization of QCD is accomplished by replacing the bare 
parameters with renormalized ones, whose values are fixed by considering the high- 
energy behaviour of Green’s functions, usually computed in the MS-scheme of 
dimensional regularization. However, at low energies it is convenient to adopt a 
hadronic renormalization scheme, in which the bare parameters are eliminated in 
favour of quantities such as hadron masses and decay constants (see Sect. 5.2.4). 
Since QCD is expected to describe both the low- and high-energy regimes of the 
strong interaction, one should be able to express the quantities of Eq. (5.115), which 
are determined from the high-energy behaviour, in terms of hadronic quantities. 
In other words, by matching a hadronic renormalization scheme to a perturbative 
scheme like MS one achieves the non-perturbative renormalization of QCD at all 
scales. In particular, one can express the fundamental parameters of QCD (running 
coupling and masses, or, equivalently, the A-parameter and RGI quark masses) in 
terms of low-energy, hadronic quantities. This amounts to predicting the values of 
these fundamental parameters from first principles. 


10The expressions for bg and by, as well as the A-parameter have already been shown in Sect. 5.2.4. 
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5.5.1 Non-perturbative Renormalization 


To illustrate the problem of matching hadronic and perturbative schemes like MS, 
it is instructive to discuss the determination of the light quark masses. A convenient 
starting point is the PCAC relation, which for a charged kaon can be written as 


fkmg = (Mu + ms) (0| (üyss)| K*). (5.116) 


In order to determine the sum of quark masses (m, + ms), using the experimentally 
determined values of fx and mx, it suffices to compute the matrix element 
(Olayss|K +) in a lattice simulation, as outlined in Sect. 5.2.3 (see Eq. (5.64)). The 
dependence on the renormalization scale and scheme cancels in Eq. (5.116), since 
the quantities on the left hand side are physical observables. Thus, in order to 
determine the combination (m, + ms) in the MS-scheme, one must compute the 
relation between the bare matrix element of the pseudoscalar density evaluated on 
the lattice and its counterpart in the MS-scheme: 


(uyss)ys = Zr(go. ap) tyss)a- (5.117) 


Here, u is the subtraction point (renormalization scale) in the MS-scheme. Provided 
that Zp and the matrix element of (Wyss)ja are known, one can use Eq. (5.116) 
to compute (Mu + ms)/fx, which is just the ratio of a renormalized fundamental 
parameter expressed in terms of a hadronic quantity, up to lattice artefacts. In 
Fig. 5.4 we have already shown the continuum extrapolation of this ratio. !! 

The factor Zp is obtained by imposing a suitable renormalization condition 
involving Green’s functions of the pseudoscalar densities in the MS as well as the 
hadronic scheme. Since the MS-scheme is intrinsically perturbative, in the sense that 
masses and couplings are only defined at a given order in the perturbative expansion, 
it is actually impossible to formulate such a condition at the non-perturbative level. 
In perturbation theory at one loop one finds 


2 

Zp(go. aj) =14+ 2 [Eme c] + Og). (5.118) 
4n (x 

where C is a constant that depends on the chosen discretization of the QCD 
action. Expressions like these are actually not very useful, since perturbation 
theory formulated in terms of the bare coupling go converges rather slowly, so 
that reliable estimates of renormalization factors at one- or even two-loop order 
in the expansion cannot be obtained. Thus it seems that the problem of non- 
perturbative renormalization is severely hampered by the intrinsically perturbative 


! The figure actually shows the ratio for the RGI quark masses, instead of those renormalized in 
the MS-scheme. 
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Fig. 5.12 Sketch of the matching of quark masses computed in lattice regularization and the MS- 
scheme, via an intermediate renormalization scheme X 


nature of the MS scheme in conjunction with the bad convergence properties of 
lattice perturbation theory. 

This problem can, in fact, be resolved by introducing an intermediate renormal- 
ization scheme. Schematically, the matching procedure for the pseudoscalar density 
(or, equivalently, the quark mass) via such a scheme is sketched in Fig.5.12. At 
low energies, corresponding to typical hadronic scales, it involves computing a non- 
perturbative matching relation between the hadronic and the intermediate scheme X 
at some scale uo. This matching step can be performed reliably if uo is much smaller 
than the regularization scale a~!. In the following step one computes the scale 
dependence within the intermediate scheme non-perturbatively from uo up to a scale 
u > no, which is large enough so that perturbation theory can be safely applied. 
At that point one may then determine the matching relation to the MS-scheme 
perturbatively. Alternatively, one can continue to compute the scale dependence 
within the intermediate scheme to infinite energy via a numerical integration of 
the perturbative RG functions. According to Eq. (5.114) this yields the relation to 
the RGI quark mass. Since the latter is scale- and scheme-independent, one can 
use directly the perturbative RG functions, which in the MS-scheme are known to 
four-loop order [71], to compute the relation to mygg at some chosen reference scale. 
By applying this procedure, the direct perturbative matching between between the 
hadronic and MS-schemes (upper two boxes in Fig. 5.12), using the expression in 
Eq. (5.118) is thus completely avoided. 

Decay constants of pseudoscalar mesons provide another example for which the 
renormalization of local operators is a relevant issue. For instance, the kaon decay 
constant is defined by the matrix element of the axial current, i.e. 


fimx = (0 K(tyoyss)(0)] K*). (5.119) 
If the matrix element on the right hand side is evaluated in a lattice simulation, then 


the axial current in the discretized theory must be related to its counterpart in the 
continuum via a renormalization factor ZA: 


(uyoyss) = Za(go) (t yoyss)la- (5.120) 
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Normally one would expect that the chiral Ward identities ensure that the axial 
current does not get renormalized. However, this no longer applies if the discretiza- 
tion conflicts with the symmetries of the classical action. This is clearly the case 
for Wilson fermions, which break chiral symmetry, such that the resulting short- 
distance corrections must be absorbed into a renormalization factor Za. Similar 
considerations apply to the vector current: if the discretization does not preserve 
chiral symmetry, current conservation is only guaranteed if the vector current is 
suitably renormalized by a factor Zy, which must be considered even in the massless 
theory. Unlike the case of the renormalization factor of the pseudoscalar density, ZA 
and Zy are scale-independent, i.e. they only depend on the bare coupling go. From 
the above discussion it is obvious that perturbative estimates of Z4 and Zy are 
inadequate in order to compute hadronic matrix elements of the axial and vectors 
currents with controlled errors. A non-perturbative determination of Z4 and Zy can 
be achieved by imposing the chiral Ward identities as a renormalization condition. 

Two widely used intermediate schemes, namely the Schródinger functional (SF) 
and the Regularization independent momentum subtraction (RI/MOM) schemes are 
briefly reviewed in the following. We strongly recommend that the reader consult 
the original articles (Refs. [72-75] for the SF, and [76] for RI/MOM) for further 
details. 


5.5.2 Finite Volume Scheme: The Schrodinger Functional 


The Schródinger functional is based on the formulation of QCD in a finite volume 
of size L? - T —regardless of whether space-time is discretized or not—with suitable 
boundary conditions. Assuming that lattice regularization is employed, one imposes 
periodic boundary conditions on the fields in all spatial directions, while Dirichlet 
boundary conditions are imposed at Euclidean times xg = 0 and xo = T. In order 
to make this more precise, let C and C’ denote classical configurations of the gauge 
potential. For the link variables at the temporal boundaries one then imposes 


Uk) ago = eS, Ux(x)lg-r = e"€. (5.121) 


In other words, the links assume prescribed values at the temporal boundaries, but 
remain unconstrained in the bulk (see Fig. 5.13). 

Quark fields are easily incorporated into the formalism. Since the Dirac equation 
is first order, only two components of a full Dirac spinor can be fixed at the 
boundaries. By defining the projection operator P+ = i + yo), one requires that 
the quark fields at the boundaries satisfy 


Polaco = 9G), — P-V GOL er E 
VO)P-[. om PG)» VO)PH| por = oO) (5.122) 
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Fig. 5.13 Left panel: sketch of the SF geometry, indicating the classical gauge potentials at the 
temporal boundaries. Middle panel: correlation function of boundary quark fields ¢,¢ with a 
fermionic bilinear operator in the bulk. Right panel: boundary-to-boundary correlation function 


where p, ..., 0’ denote prescribed values of the fields. The functional integral over 
all dynamical fields in a finite volume with the above boundary conditions is called 
the Schródinger functional of QCD: 


ZIC’, p^, p: C, p, p] — J oii. wie. (5.123) 


The classical field configurations at the boundaries are not integrated over. Using 
the transfer matrix formalism, one can show that this expression is the quantum 
mechanical amplitude for going from the classical field configuration (C, p, p) at 
xo = Oto (C', p', p') at xo = T. 

Functional derivatives with respect to p,..., p' behave like quark fields located 
at the temporal boundaries, and hence one may identify 


=, Oi, "=, PO--— 
= , x)= ——áe = aS x)= - a: 
5p(x) dp(x) 5p" (x) dp! (x) 
(5.124) 
The boundary fields ¢,¢,... can be combined with local composite operators 


(such as the axial current or the pseudoscalar density) of fields in the bulk to 
define correlation functions. Particular examples are the correlation function of the 
pseudoscalar density, fp and the boundary-to-boundary correlation fi 


f(x) = E Pysy V ODE Gs e*t). 


al? 


3L6 


U,V,Y,Z 


f=- (Eysi EGO) 6125) 


5 QCD on the Lattice 187 


which are shown schematically in the middle and right panels of Fig.5.13. In the 
above expressions, the Pauli matrices act on the first two flavour components of the 
fields. 

The specific boundary conditions of the Schródinger functional ensure that the 
Dirac operator has a minimum eigenvalue proportional to 1/ T in the massless case 
[73]. As a consequence, renormalization conditions can be imposed at vanishing 
quark mass. If the aspect ratio T/L is set to some fixed value, the spatial length L is 
the only scale in the theory, and thus the masses and couplings in the SF scheme run 
with the box size. The recursive finite-size scaling study described below can then 
be used to map out the scale dependence of running quantities non-perturbatively 
from low to high energies. It is important to realize that in this way the relevant 
scale for the RG running (the box size L) is decoupled from the regularization scale 
(the lattice cutoff a). It is this features which ensures that the running of masses and 
couplings can be obtained in the continuum limit. 

Let us now return to our earlier example of the renormalization of quark masses. 
The transition from lattice regularization and the associated hadronic scheme to the 
SF scheme is achieved by computing the scale-dependent renormalization factor 
which links the pseudoscalar density in the intermediate scheme to the bare one, i.e. 


(Sysu)sr (uo) = Zp(go, ano) (Sysu)ia(a). (5.126) 


A renormalization condition that defines Zp can be formulated in terms of SF 
correlation functions: 


Vii 


fp (xo) xo=T /2 


Zp(go, auo) = c ; Ho = 1/Lmax; (5.127) 


where the constant c must be chosen such that Zp = 1 in the free theory. In order 
to determine the RG running of the quark mass non-perturbatively one can perform 
a sequence of finite-size scaling steps, as illustrated in Fig.5.14. To this end one 
simulates pairs of lattices with box lengths L and 2L, at fixed lattice spacing a. The 
ratio of Zp evaluated for each box size yields the ratio msp(L)/msp(2L) (upper 
horizontal step in Fig. 5.14), which amounts to the change in the quark mass when 
the volume is scaled by a factor 2. In a subsequent step, the physical volume can be 
doubled once more, which gives msgp(2L)/msp(4L). The important point to realize 
is that the lattice spacing can be adjusted for a given physical box size. In this way 
the number of lattice sites can be kept at a manageable level, while the physical 
volume is gradually scaled over several orders of magnitude, as indicated by the 
zig-zag pattern in Fig. 5.14. Furthermore, each horizontal step can be performed for 
several lattice resolutions, so that the continuum limit can be taken. By contrast, 
if one attempted to scale the physical volume for fixed lattice spacing, one would, 
after only a few iterations, end up with systems so large that they would not fit into 
any computer's memory. 
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Fig. 5.14 Illustration of the recursive finite-size scaling procedure to determine the running of 
m(L) for L + 2L — 4L — 8L. In any horizontal step L is scaled by a factor 2 for fixed lattice 
spacing a. In every diagonal shift one keeps the physical box size L fixed and increases a by an 
appropriate tuning of the bare coupling go 


In an entirely analogous fashion one can set up the finite-size scaling procedure 
for the running coupling constant in the SF scheme, gsp (L).!? Setting a value for 
the coupling actually corresponds to fixing the box size L, since the renormalization 
scale and the coupling in a particular scheme are in one-to-one correspondence. The 
sequence of scaling steps begins at the matching scale uo = 1/Lmax between the 
hadronic and SF schemes, and in order to express the scale evolution in physical 
units, the maximum box size Lmax must be determined in terms of some hadronic 
quantity, such as fx or ro. In typical applications of the method, Lmax corresponds 
to an energy scale of about 250 MeV. After n steps, the box size has decreased by a 
factor 2" (typically n = 7 —9), and at this point one is surely in the regime where the 
perturbative approximations to the RG functions are reliable enough to extract the 
A-parameter (in the SF scheme) and the RGI quark masses according to Eqs. (5.113) 
and (5.114). The transition to the MS-scheme is easily performed, since the ratios 
Asr/Ajgg as well as myys/ M are computable in perturbation theory. At that point 
one has completed the steps in Fig. 5.12, and all reference to the intermediate SF 
scheme has dropped out in the final result. 

As examples we show the running coupling and quark mass in the SF scheme 
from actual simulations of lattice QCD for Nr = 2 flavours of dynamical quarks in 
Fig. 5.15. The numerical data points in these plots originate from simulations with 
two flavours of O(a)-improved Wilson fermions and have been extrapolated to the 
continuum limit. 


"The precise definition of gsr is specified in Sect. 5.5.5 below. 
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Fig. 5.15 Running of a; (left panel) [77] and quark mass in units of the RGI mass M (right panel) 
[78] in the SF scheme. The results from simulations (full circles) are compared to the integration 
of the perturbative RG equations 


5.5.3  Regularization-Independent Momentum Subtraction 
Scheme 


An alternative choice of intermediate renormalization scheme is based on imposing 
renormalization conditions in terms of Green's functions of external quark states in 
momentum space, evaluated in a fixed gauge (e.g. Landau gauge) [76]. The external 
quark fields are off-shell, and their virtualities are identified with the momentum 
scale. Here we summarize the basic steps in this procedure by considering a quark 
bilinear non-singlet operator Or = y 1r Vo, where I denotes a generic Dirac 
structure, e.g. T = ys in the case of the pseudoscalar density. The corresponding 
renormalization factor Zr is fixed by requiring that a suitably chosen renormalized 
vertex function Ar n (p) be equal to its tree-level counterpart: 


ArRG)| 2 = ZrZy Ar(p) LEN (5.128) 


p-u 
This condition defines Zr up to quark field renormalization. Such a prescription 
can be formulated in any chosen regularization, which is why the method is said 
to define a regularization-independent momentum subtraction (RI/MOM) scheme. 
However, Zr does depend on the external states and the gauge. 

In order to connect to our previous example of the renormalization of quark 
fields, we consider the pseudoscalar density for concreteness: T = ys = “P”. In 
this case, Ap 0 = ys ® Lcolour, and Eq. (5.128) can be cast into the form 


x 1 
Zp OV (go, au) Zy (40. 4p) 5 Tr (AP(p)ys)| = 1, (5.129) 
pray 


where the trace is taken over Dirac and colour indices. 
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In practice, the unrenormalized vertex function Ap(p) is obtained by computing 
the quark propagator in a fixed gauge in momentum space and using it to amputate 
the external legs of the Green's function of the operator in question, evaluated 
between quark states, i.e. 


Ap(p) = $(p) ! Ge(p) S(p |, S(p) = J dfx e !?* (S(x, 0)), 


Gp(p) = f dfx d* ye “PE (yr (x) (Wi (0)ysV2(0)) Y2(y)). (5.130) 


The quark field renormalization constant a ? can be fixed, e.g. via the vertex 


function of the vector current: 


(5.131) 


1 
Zy = alt [Ave (pyri } n 
pau 


The numerical evaluation of the Green’s function and quark propagators in momen- 
tum space is performed on a finite lattice with periodic boundary conditions. Unlike 
the situation encountered in the Schródinger functional, there is thus no additional 
infrared scale, so that the renormalization conditions cannot be evaluated directly 
at vanishing bare quark mass. A chiral extrapolation is then required to determine 
mass-independent renormalization factors. 

Equation (5.128) is also imposed to define the subsequent matching of the 
RI/MOM and MS schemes. In this case, the unrenormalized vertex function on the 
left-hand side is evaluated to a given oder in perturbation theory, using the MS- 
scheme of dimensional regularization. For a generic quark bilinear this yields the 


factor ZMS (gs 00). In our specific example of the pseudoscalar density operator 


in the PCAC relation, Eq. (5.116), the transition between the RI/MOM and MS 
schemes is provided by 


(üyss)gss (I) = Rp(it/ Uu) ZS (9o, ap) (üyss taa). (5.132) 


The ratio Rp admits a perturbative expansion in terms of the coupling in the MS- 
scheme, i.e. 


ZMS (aes GO) 


TB -14-R8 + OF 
Zp.OM (go, ap) 


Rp(ū/u) = gis (5.133) 


4 
E 


'3Tn this expression y denotes the conserved lattice vector current, which involves quark fields at 
neighbouring lattice sites, and which is known not to undergo any finite renormalization, such that 
Zy = 1. 
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which is not afflicted with the bad convergence properties encountered in the direct 
matching of hadronic and MS-schemes. Finally, for the whole method to work, one 
must be able to fix the virtualities u of the external fields such that 


Aoc « u X la. (5.134) 


In other words, the method relies on the existence of a “window” of scales in which 
lattice artefacts in the numerical evaluation are controlled, y «& 1/a, and where 
u is also large enough such that the perturbative matching to the MS scheme can 
be performed reliably. In the ideal situation one expects that the dependence of 
ZION dii: au) on the virtuality u inside the “window” is well described by the 
perturbative RG function. 

The RI/MOM prescription is a flexible method to introduce an intermediate 
renormalization scheme and can easily be adapted to a range of operators and 
lattice actions. In particular, the extension to discretizations of the quark action 
based on the Ginsparg-Wilson relation is straightforward. This contrasts with the 
situation encountered in the Schródinger functional, where extra care must be taken 
to ensure that imposing Schródinger functional boundary conditions is compatible 
with the Ginsparg-Wilson relation [79-8 1]. On the other hand, the non-perturbative 
scale evolution, for which the Schródinger functional is tailored, is not so easy to 
incorporate into the RI/MOM framework. Hence, the matching between RI/MOM 
and MS schemes is usually performed at fairly low scales, ie. jj = jo in the 
notation of Fig. 5.12. Furthermore, the accessible momentum scales in the matching 
of hadronic and RI/MOM schemes are typically quite narrow, i.e. auo 7 1. Special 
care must also be taken when one considers operators that couple to the pion, such as 
the pseudoscalar density. In this case the vertex function receives a contribution from 
the Goldstone pole, which for p = u = 0 diverges in the limit of vanishing quark 
mass. The fact that the chiral limit is ill-defined may spoil a reliable determination 
of the renormalization factor, in particular when the accessible “window” is narrow 
such that jz cannot be set to large values. 


5.5.4 Mean-Field Improved Perturbation Theory 


Another widely used strategy is to avoid the introduction of an intermediate 
renormalization scheme altogether and attempt the direct, perturbative matching 
between hadronic and MS schemes via an effective resummation of higher orders in 
the expansion. In this sense one regards the bare coupling and masses as parameters 
that run with the cutoff scale a~!. 

The bad convergence properties of perturbative expansions such as Eq. (5.118) 
has been attributed to the presence of large gluonic tadpole contributions in the 
relation between the link variable U,, (x) and the continuum gauge potential A,, (x). 
It was already suggested by Parisi [82] that the convergence of lattice perturbation 


theory could be accelerated by replacing the bare coupling gà by an "improved" 
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coupling g* = gi / ug, where ug denotes the average plaquette: 


u$ = 4Re (tr P), = TD Pus (5.135) 
s Vv 


A more systematic extension of the idea of setting up such a “tadpole” or *mean- 
field" improved version of lattice perturbation theory was presented in Ref. [83]. 
The main strategy is to factor out tadpole contributions through a redefinition of the 
link variable: 


U,(x) > Ü,(x) = U&(x)/uo, (5.136) 


where ug is the average link, defined e.g. via the average plaquette. A factor of uo 
is then absorbed into the normalization of the quark fields. According to [83], the 
mismatch between non-perturbative estimates for uo and its expression in lattice 
perturbation theory can be used to improve the convergence properties of lattice 
perturbation theory via a relative rescaling of quark fields in the continuum and 
lattice formulations. To make this more explicit, we consider Wilson fermions (see 
Sect. 5.2.2). Factoring out the average link uo modifies the quark field normalization 
of Eq. (5.36) according to 


y 9t (y) = J/2kupo W(x), yoo" (x) = y (x) y 2kuo. (5.137) 


The general expression for the perturbative expansion of Zp in powers of the bare 
coupling reads 


Zp(g0, an) = 1+ g5Zp (aj) + O(g), (5.138) 


where yA (au) denotes the one-loop expansion coefficient. The convergence of 
Eq. (5.138) can be accelerated by dividing out uo in the rescaling factors of the quark 
and antiquark fields using its perturbative expansion and replacing it by its non- 
perturbative estimate computed in simulations. In other words, the rescaling of the 
quark fields is exploited to divide out the relative mismatch between the perturbative 
and non-perturbative estimates for the average link in expressions like Eq. (5.138): 


(ita = uo f1 = u83 + Oi) ] (5.139) 


where the one-loop coefficient u® = —1/12 for the average plaquette. In this way, 


i.e. by combining non- vermirbanvely determined values for uo with its perturbative 
expansion, and after replacing the bare coupling by £?, one arrives at the mean-field 
improved version of Eq. (5.138), viz. 


Zp" = uo {1+ [ZP au) -up | g]. (5.140) 
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Instead of Parisi’s “boosted” coupling g other expansion parameters have been 
suggested, which are expected to accelerate the convergence of the perturbative 
series [83]. While mean-field improvement is a general procedure, which is easily 
adapted to a wide range of actions and operators, it is difficult to estimate the 
effectiveness of the resummation and, in turn, the size of higher-order corrections. 
Also, a principal problem is the identification of the running scale with the cutoff, 
since it is difficult to separate renormalization effects from lattice artefacts. 


5.5.5 The Running Coupling from the Lattice 


Having discussed the non-perturbative renormalization of QCD in detail, we shall 
now present results for the running coupling constant, œs, from two different 
approaches. This complements the discussion in Sect. 4.6, where determination of 
as from experimental data has been described in detail. Any lattice calculation of 
as proceeds along the following steps: 


1. A non-perturbative definition of the coupling must be provided in terms of 
some quantity which can be evaluated in lattice simulations with high precision. 
This amounts to specifying the running coupling in a particular renormalization 
scheme, ox(aj9), which can be related to the MS scheme of dimensional 
regularization. 

2. Scale setting: the matching to a hadronic scheme is performed via the calibration 
of the lattice spacing, which yields the scale j49 at which ox is evaluated in units 
of some physical quantity Q: 


Q [MeV] 
(aQ) ` 
3. Running and matching: provided that the energy scale at which œx has been 


determined is large enough, one can use perturbation theory to relate ox to the 
coupling in the MS scheme, e.g. 


po [MeV] = (apo) : a^! [MeV] = (apo) - (5.141) 


anys (it) = ex (u) + e Gu pex Qa)? +... (5.142) 


4. The A-parameter can be determined from the asymptotic behaviour of ax via 
Eq. (5.113). 


The attentive reader has surely noticed that the above steps follow closely the 
general strategy for non-perturbative renormalization via an intermediate renormal- 
ization scheme outlined in Sect. 5.5.1 and Fig. 5.12. 
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First we discuss the determination of o; from the Schródinger functional. The 
definition of the running coupling is somewhat technical in this case. The starting 
point is the effective action of Eq. (5.123); the classical field configurations at the 
boundaries at x9 = 0, T can be parameterized in terms of a real variable n: 


C-C(p, C=C). (5.143) 


For explicit expressions we refer the reader to the original article [84]. The 
associated effective action is defined by 


T(n) = —InZ[C'(n), 0,0; C (1), 0,0] (5.144) 


and admits a perturbative expansion in terms of the bare coupling go, viz. 


1 
Tn) = —To- T1 + gr * .... (5.145) 
80 


A renormalized coupling can then be defined in terms of the effective action via 


1 0 0 
—— = 1—Tr —T : 5.146 
INTE 5; f T CN SE 


This definition is imposed at vanishing quark mass, m = 0, and provided that the 
aspect ratio T/L has been fixed, the spatial dimension is the only scale in the theory, 
such that gsf (L) runs with the box size L. From the perturbative expansion of I (n) 
one easily infers that gag CL) = gi at tree level. The quantity on the right-hand side 
is given in terms of plaquettes attached to the SF boundaries and can be computed 
with good statistical precision. 

If Lmax denotes the largest box size for which gsp is computed, then the scale 
is set by expressing Lmax in terms of some known dimensionful quantity, for 
instance, by computing the combination Lmax/ro in the continuum limit and using 
ro = 0.5 fm. 

The finite-size scaling procedure described earlier in Sect. 5.5.1 allows to 
compute the scale evolution of gsr over several orders of magnitude. In particular, 
each of the horizontal steps in Fig. 5.14 can be repeated for several values of the 
lattice spacing, so that the continuum limit is reached by taking a/L — O0 for 
fixed physical box size L. The resulting scale evolution of asp = gi /An is 
shown in Fig. 5.15 and compared to the perturbative evolution. Although the non- 
perturbatively determined points are described very well by perturbation theory, 
using the three-loop expression for the RG function, one should realize that this 
behaviour may be specific to the SF scheme and should not be generalized to other 
schemes. 
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Starting from uo = 1/Lmax one obtains the coupling at u = 29 / Lmax after nine 
steps in the scaling procedure. At that point one can extract the A-parameter by 
evaluating the exact expression 


- —bi/(2b2) -2 8u) 1 1 b 
— 2 0 a —1/(2b08* (0) = EOS Vo Es a 
Asp = u (bog w) e zl [ dx $0 + DE yis ; 
(5.147) 


where u = 2° / Lmax. The integral can be computed using the three-loop approxima- 
tion to the RG £-function in the SF scheme. Equation (5.147) yields the combination 
AsrLmax, and knowledge of Lmax in physical units allows to express the A^- 
parameter in MeV. Conversion to the MS scheme is easily achieved, since the ratio 
of A-parameters in two different schemes is computable via a one-loop calculation 
in which En. is expanded in powers of Boe This gives 


Avis = ASF: CA. (5.148) 


The entire procedure of determining the A-parameter via the Schrödinger functional 
has so far been carried out for the pure SU(3) gauge theory (Nr = 0) and for 
QCD with two flavours of dynamical quarks. The values of the coefficient c4 are 
2.04872(4) for Nr = O0 [84] and ca = 2.382035(3) for Ne = 2 [85], and the 
resulting values for Ays are [75, 77] 


AO ro = 0.602 + 0.048 & A® = 238+ 19 MeV 
E oa (5.149) 
AO ro = 0.624 0.0440.04 & AQ) = 245+ 16+ 16 MeV, 
MS MS 
where ro = 0.5fm is used to convert into physical units. There is room for 


improvement in several respects: for Nf = 2 the extrapolation to the continuum 
limit can be made more reliable by including simulations at smaller lattice spacings, 
which should reduce the first of the two quoted errors. Also, the conversion into 
physical units should be performed in terms of a quantity such as fx, which is 
directly accessible in experiment. Finally, the calculation must be repeated with 
more dynamical quark flavours, in order to allow for a direct comparison with 
phenomenology, since all experimental determinations yield the A-parameter for 
Ng = Aor 5 quark flavours. 

The determination of a; and Ag;s via the Schrödinger functional is quite 
involved. However, it is the only method so far, which allows to map out the running 
of a; in a completely non-perturbative manner, including the systematic elimination 
of lattice artefacts. In particular, perturbation theory is used only for energy scales 
well above 50 GeV. 

The second method that we will discussed here in some detail is the determi- 
nation of o; via heavy quarkonia. Below we present an account of the calculation 
published in [86]. Here, the dynamical quark effects of the light (u, d, s) quarks have 
been accounted for in simulations with improved staggered quarks employing the 
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fourth-root trick (see Sect. 5.2.6). In this approach, the coupling constant is defined 
in the so-called *V-scheme" via the heavy quark potential in momentum space: 


An 
V(q)— ESO (5.150) 


Small Wilson loops such as the plaquette can be expanded in powers of ay 
— In 1 (Retr P) = cP ay(sp/a) + cp” [av(sp/a)P +..., (5.151) 


where sp is a real dimensionless variable which can be chosen to optimize the 
convergence properties of the expansion [83]. Equation (5.151) thus provides 
the link between the coupling and a quantity that is easily computed in lattice 
simulations. The above expression can be generalized to (small) rectangular Wilson 
loops W,; with area r - t: 


oo 


-In $ (Wr) = Y df? lov Ga/a)l . (5.152) 
k=0 


Knowledge of the expansion coefficients in conjunction with lattice data for the 
quantity on the left hand side allows for the determination of ay. 

The second step, namely the calibration of the momentum scale which appears in 
the argument of oy, is done by determining the lattice spacing from mass splittings 
in the bottomonium system. Here one typically considers the mass differences 
between the Y and Y’, or alternatively, between the x, and Y states. Of course, any 
other low-energy quantity like fx or ro could be used. It can be argued, however, that 
mass splittings in heavy quarkonia are a natural choice for setting the scale in this 
particular approach, chiefly because of their relative insensitivity to the exact value 
of the heavy quark mass. Since the b-quark mass of mp ~ 4 GeV is greater than 
typical values of the inverse lattice spacing, a^! one must employ special techniques 
to deal with heavy quarks on the lattice. In [86] this is done via an approach based 
on non-relativistic QCD. A detailed discussion of the specific treatment of heavy 
quarks in lattice simulations is deferred to Sect. 5.7.2. 

After setting the scale, the Wilson loops (W,;) computed on ensembles with 
Ny = 3 flavours of rooted staggered quarks are used to determine oy via a global fit 
involving data at three different values of the lattice spacing. This yields 


o (7.5 GeV) = 0.2082 + 0.0040, (5.153) 


where the superscript on the coupling reminds us that the result is valid in the 
three-flavour theory. The relation to the coupling in the MS-scheme at the Z-pole is 
determined in perturbation theory, by employing the third-order expansion of agg 
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in terms of ay [87]: 
(3) 7-3/6 (3) 2 7 ju 51° Biel? 
aC (75/64) = a? q) + = [ay «| — (0.3111...) k «| . — (5454) 


which yields um (3.26 GeV). This coupling is then translated to c (Mz) via the 
numerical integration of the four-loop RG f-function, including the effects from 
quark mass thresholds at m, and mp, which finally yields 


um © (Mz) = 0.1170 + 0.0012. (5.155) 


This result is included in the world average of ane (Mz) = 0.1176 + 0.002 in 
Ref. [61]. It is also in very good agreement with the non-lattice global estimate of 
ae © (Mz) = 0.1182 + 0.0027 [88]. 

“the running and matching in this approach is done perturbatively, involving 
energy scales from Mz down to mec. In this sense the method may be regarded 
as similar in spirit to, say, the determination of œs from the semi-leptonic branching 
ratio of t decays, as in both cases the coupling is extracted from the perturbative 
expansion of a particular observable. While for t-lepton decays an experimentally 
measured quantity is considered, it is the non-perturbatively computed data for the 
Wilson loops in the lattice approach which are expressed in terms of the running 
coupling. This contrasts with the Schródinger functional approach, where also the 
running is computed non-perturbatively, albeit with considerable numerical effort. 

The error on the result in Eq. (5.155) is rather small. It is left for future studies 
to confirm this level of precision, which must entail further investigations into the 
influence of lattice artefacts, as well as the validity of the fourth root trick. 


5.5.6 Light Quark Masses 


We shall now apply the general framework of non-perturbative renormalization 
to the determination of quark masses. Typically one distinguishes the "light" 
u,d,s quarks from the "heavy" c, b, t quarks. At first, this distinction may seem 
rather arbitrary. It is actually based on the relative magnitude of the quark masses 
compared with the chiral symmetry breaking scale Ay, which separates “soft” 
from “hard” momentum scales. Masses and momenta well below A, break chiral 
symmetry only softly, so that spontaneous chiral symmetry breaking still dominates 
over the explicit breaking generated by non-zero values of the quark masses. Gasser 
and Leutwyler [89, 90] have demonstrated that QCD with u, d, s flavours can be 
studied via an "effective" theory of Goldstone boson fields. This approach, called 
Chiral Perturbation Theory (ChPT), has an SU(3);, & SU (3)g chiral symmetry, 
which is spontaneously broken to the SU(3) vector subgroup. The associated 
Goldstone bosons are then identified with the pions, kaons and n-mesons, whose 
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masses are indeed small compared to typical hadronic scales, such as the mass of 
the nucleon, for instance. Thus, the magnitude of A , is identified with a value close 
to 1 GeV. In ChPT, quantities like hadron masses, decay rates or cross sections 
are computed through an expansion in powers of quark masses (and 4-momenta) 
about the chiral limit. The inclusion of the charm quark into the formalism is rather 
useless, since the masses if the lightest charmed pseudoscalar mesons are far greater 
than A, ~ 1 GeV. 

The top quark can be safely ignored in this context, since its lifetime is an order 
of magnitude shorter than typical QCD processes. As a consequence, the top quark 
does not undergo any hadronization effects (for instance, “toponium”, i.e. tt bound 
states have never been observed), but rather decays weakly into a W-boson and a 
b-quark. 

The mass of the b-quark is rather large (and to some extent this is also true 
for the charm quark), so that one may attempt to determine their values from 
perturbative expansions in a; of some mass-dependent quantity. By contrast, in 
the light quark sector non-perturbative effects such as spontaneous chiral symmetry 
breaking dominate. As far as the determination of the masses of the u, d, s quarks 
is concerned, ChPT is of limited value, since only ratios of quark masses can 
be predicted, but not their absolute values. The reason is that although the light 
quark masses appear as parameters of ChPT, their values cannot be fixed by 
chiral symmetry (see Sect. 5.6.1 for more details). The absolute normalization must 
therefore be provided by non-perturbative methods such as lattice simulations or 
QCD sum rules. 

Below we will focus on attempts to compute the values of the light quark 
masses in units of some hadronic quantity. As indicated in Sect. 5.5.1, this entails 
the knowledge of the renormalization factor that links lattice regularization to the 
chosen continuum scheme. Lattice simulations have maximum impact in the light 
quark sector, owing to the dominance of non-perturbative effects, which is in fact 
signified by the large uncertainties quoted for the values of the u, d and s quark 
masses in the particle data book [61 ]. 

The general procedure for the determination of light quark masses in lattice QCD 
starts from the PCAC relation, Eq. (5.116). Assuming exact isospin symmetry, Mu = 
ma, one can consider a generic light flavour £ with mass m, = m = 3n, c mg). 
In order to determine, say, the combination m + ms, one must define a particular 
hadronic renormalization scheme, by specifying the lattice scale and the hadronic 
quantity that fixes the value of m + ms. Furthermore, the renormalization factor 
which connects hadronic and continuum schemes must be known. Equation (5.116) 
can then be rewritten such that it yields the sum of RG-invariant quark masses M+ 
Ms in units of the quantity Q which sets the lattice spacing: 


M M bare 2 
MAS Zu (&?) x (: + O(a). (5.156) 
Q mps—my exp 


K 
bare 2 
Gps Q 


5 QCD on the Lattice 199 


In this expression, the subscript “exp” denotes the experimental values for the 


respective quantities, while the matrix element Gp is given by 


= GR" = (0 |Gyss)ia| K). (5.157) 


mps—mk 


The pseudoscalar decay constant fum parameterizes the matrix element of the 
unrenormalized axial current, i.e. 


bare 


PTS = fg™°mr = (0 |(Cyoyss)iat| K). (5.158) 


mps—mk 


The renormalization factor Zw relates the bare current quark mass to the RG- 
invariant mass. Thus, the task for lattice calculations is to compute the ratio 
INO / Gls for a generic pseudoscalar state and tune the bare quark mass such 
that mps = mx. By combining the result with the renormalization factor Zw 
and the experimental value of mg / Q*, the RGI quark masses in units of Q 
are obtained up to lattice artefacts of order aP”, where p is characteristic of the 
details of the discretization. Since the RGI quark masses are scale- and scheme- 
independent quantities, the factor Zw depends only on the bare coupling go. 
Using the Schródinger functional as the intermediate renormalization scheme, non- 
perturbative estimates of Zm computed for O(a) improved Wilson fermions within 
a wide range of bare couplings, have been published in Refs. [75] and [78]. In this 
case, Zy is given by 


M ZA(go) 


i 80) (5.159) 
msp (uo) Zp(g0, auo) 


ZM(80) = 


where the ratio M/Mmsp(uo) is computed via the finite-size scaling procedure. The 
transition between lattice regularization and the SF-scheme is accomplished by 
determining Zp and the renormalization factor Za of the axial current.!^ Note that 
the dependence on the intermediate matching scale uo drops out completely in this 
expression. Finally, the conversion to the MS-scheme is performed by considering 


myst) 


m 2M(80): (5.160) 


Zm(80, au) = 
where the ratio mggīg(4)/M can be computed through the numerical integration of 


the perturbative approximation of the anomalous dimension t and the 6-function at 
four loops. This yields [35, 78] 


(5.161) 


Mmyg(2GeV) [0.7208 Nr—0 
M ~ (0.7013, Nr—2. 


^1f the fermionic discretization preserves chiral symmetry Za = 1, while for Wilson fermions ZA 
should be computed non-perturbatively. For the SF this was performed in Refs. [91, 92]. 
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Table 5.2 Results for the strange quark mass in the MS-scheme at u = 2 GeV and for the ratio 
ms /m, in the continuum limit of the quenched approximation 


Collaboration Action | Q Renorm. PC [MeV] ms/m moo) [MeV]. 
SPQAR [93] Wilson | my» |RI 105(9)(6) —|24.3(2(6) | 95(9)(5) 
CP-PACS [47] Wilson |m,  |pert. 114(2)(8) 26.561) 98(2)(8) 
ALPHA/UKQCD [35]  |Clover | fx |SF 97(4) 99(4) 
JLQCD [94] Stagg. |m, | RI | 106(7) 25.1(2.4) | 95(6) 


The table includes information on the fermionic action employed in the simulations, the quantity 
Q that sets the scale, and the type of renormalization (RI/MOM, SF or tadpole improved 
perturbation theory. The right-most column contains the results for the strange quark mass when 
converted into a common hadronic scheme, in which the scale is set by Q' = 1/ro, assuming that 
ro = 0.5 fm 


Estimates for the strange quark mass itself can be obtained in two ways: first, one 
combines M + M; with the ratio M,/M = 24.4 + 1.4 estimated in ChPT [38]. 
Alternatively, one might attempt to compute M directly from lattice data, by 
considering Eq.(5.116) for a pion. In this case, however, one relies on chiral 
extrapolations, because of the difficulties involved when tuning the masses of the 
light quarks towards the values of the physical up- and down-quark masses. 

In Table 5.2 we present a selection of results for the mass of the strange quark 
in the quenched approximation, normalized in the MS-scheme at u = 2GeV, 
as well as the ratio M,/ M. Two observations are worth mentioning: first, direct 
determinations of M; / M via chiral extrapolations agree well with the estimate from 
ChPT, even though the chiral limit is ill-defined in the quenched approximation. 
Second, the different systematics in the simulations (lattice actions, renormalization 
of local operators) generate a spread of seemingly incompatible results for the mass 
of the strange quark. However, the spread can be traced to the particular choice of 
hadronic renormalization scheme. To this end one can compute the relation between 
quark masses computed for two different lattice scales, Q and Q'. From Eq. (5.156) 
one easily infers that the strange quark mass m2” estimated using Q’, is related to 
its counterpart m® via [37] 


me evi = (2) (£) m2) [MeV]. (5.162) 
Q lat Q exp 


Here, the subscripts “lat” and “exp” refer to lattice and experimental estimates of 
the scale ratios. The ratio (Q'/ Q)ia can be determined in the continuum limit using 
published lattice data, and the deviation of the proportionality factor from unity is a 
measure of the relative quenching effects, when either Q or Q' is chosen to set the 
scale. Once the results have been converted to the common scale ro, the estimates for 
ms in the continuum limit show remarkable consistency, despite the very different 
systematic effects among the simulations included in this analysis (c.f. Table 5.2). 
This demonstrates that lattice artefacts and renormalization effects can be controlled 
at the level of a few percent with the available techniques. 
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Table 5.3 Selection of recent unquenched results for the light quark masses 


Collaboration Ne |Actio |Q Ren. |m, [MeV] | m,;/m m [MeV] 
EE CM 2+1 |Clover |m pert. | 91.1 (4$) 3.540:$4) 
JLOCD p 6.2 0.35 
HPQCD [96] 241 |Stagg. | Ay—y | pert. |87(8) 27.4(4) 32(3) 
oe [97] 2 Clover |my,ro |RI 111(9) 27(3) 4.1(4) 
UKQCD i 

SPQ,R [98] 2 Wilson | mx« RI | 10178) 4.3(4) 
ALPHA [78] 2  |Clover |ro SF |9722) 

CPPACS [99] 2 Clover |m, pert. 88) 26(2) 3.4405) 
ETM [100] 2. |tmQCD | f, RI | 105(3)(9) |27.3(3)01.2) | 3.85(12)(40) 


The challenge for current and future simulations is to eliminate the remaining 
uncertainty due to quenching. Several simulations with Ne = 2 or 2 + 1 flavours 
of dynamical quarks! based on different fermionic discretizations have produced 
results for the light quark masses, which are shown in Table5.3. Despite the 
enormous progress that has been made in simulating light dynamical quarks, 
it is important to realize that systematic effects such as lattice artefacts and/or 
renormalization effects are currently not as well controlled as in the quenched 
theory. The fact that affordable lattice spacings are still relatively large implies that 
extrapolations to the continuum limit are in general longer than in the quenched 
approximation, thereby leading to larger errors. In some cases it is not even clear 
whether the leading lattice artefacts in dynamical simulations have been isolated. 
Also, the quantity Q that sets the scale must be known at least as accurately as the 
quark mass itself, and hence the determination of these observables may prove just 
as costly. Finally, dynamical quark masses are still fairly large, especially in many 
simulations using Wilson fermions, and thus the long and potentially uncontrolled 
chiral extrapolations significantly affect estimates for the isospin-averaged light 
quark mass 7A. 


5.6 Spontaneous Chiral Symmetry Breaking 


Chiral symmetry has already been mentioned in connection with the masses of the 
light quarks. Here we will extend the general framework and elaborate on effective 
descriptions of QCD at low energies, which can be treated analytically. As we 
shall see, much can be learnt via the interplay of such effective theories and lattice 
simulations of QCD. 


15 Ne = 2 usually denotes a degenerate doublet of light (u, d) quarks, while Ne = 2 + 1 denotes a 
degenerate doublet together with a heavier third flavour, i.e. the strange quark. 
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Massless QCD with Nt flavours is invariant under independent rotations of the 


left- and right-handed components of the quarks fields. If one defines the field Y as 
the vector of Nr Dirac spinors v; via 


Y= (yas... YN) (5.163) 


its left- and right-handed components are given by 


Uy = (IQP) Y, Wr:= (LNP) Y, Pe—iüty). (5.164) 


The action of the massless theory is then invariant under transformations like 
Y > V' = expli P (or. T) +iP} (og-T)) Y, (5.165) 


where cr, og are real vectors, and T denotes the generators of SU(N¢), which 
satisfy 


Fg ud -if""T^ — Tr(T^T)— 18%, (5.166) 
The above transformation can be rewritten in terms of vector and axial rotations, i.e. 
V — Yy =exp{iay-T -- ic 4-T ys) V, (5.167) 


where dy = 1 (OR -- cop) and «4A = 1 (og — er). Invariance under these 
transformation laws is what one usually means when one says that (massless) QCD 
is invariant under a global SU(N¢)L & SU(Nf)r symmetry. 

Actually, QCD has even more global symmetries, namely a U(1)y symmetry, 
which corresponds to a common rotation of all quark flavours. The conserved charge 
derived from the Noether current, which is associated with this unbroken symmetry, 
is the quark number. The conservation of the axial current associated with the 
remaining axial U(1) symmetry is, however, severely broken by an anomalous term, 
which gives rise to strong non-perturbative effects generated by instantons. Without 
going into further detail here, we refer to common textbooks. 

Returning now to SU(Nr)_ & SU(Nr)R, we note that symmetries in sub-nuclear 
physics are usually deduced from the particle spectrum. That is, symmetries man- 
ifest themselves through the occurrence of mass-degenerate (or nearly degenerate) 
particle multiplets that can be grouped according to the irreducible representations 
of the symmetry group. Indeed, for Nf = 3 one finds that the light pseudoscalar 
mesons, i.e. the pions, kaons and yn-mesons form an octet. The mass splittings 
among the members of the octet are small when viewed on typical hadronic scales, 
and arise due to the unequal, non-zero masses of the light quarks. However, if 
the pseudoscalar octet were interpreted as a manifestation of an (approximate) 
SU(3); & SU (3) a chiral symmetry, one would expect that each member of the octet 
is accompanied by a parity partner, i.e. a scalar meson, whose mass is of the same 
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order of magnitude. This is not observed in experiment, where the lightest scalar 
mesons are found to lie 600—700 MeV above the pseudoscalar octet. One therefore 
concludes that the symmetry must be spontaneously broken. The term “spontaneous 
breaking” refers to the fact that theories like QCD possess more internal symmetries 
than those that can be inferred from the particle spectrum. In general, spontaneously 
broken symmetries are not realized as symmetry transformations involving the 
physical states of the theory. In particular, the ground state, i.e. the vacuum, is not 
invariant under the transformation. As discussed in many textbooks, it is precisely 
the invariance of the vacuum under the symmetry transformation that is required 
to ensure the degeneracy of the particle spectrum. If the vacuum is not invariant, 
certain operators may acquire a non-vanishing expectation value. In fact, a sufficient 
condition for the spontaneous breaking of the physical SU(3); & SU (3)r chiral 
symmetry is fulfilled if the expectation value of the scalar density, YY, is non-zero, 
ie. 


(ww) = (uu + dd + 55) £ 0. (5.168) 


Furthermore, according to Goldstone's theorem [101], the generator of each broken 
symmetry is associated with a massless particle. Since the masses of the members 
of the pseudoscalar octet are rather small in comparison with the proton mass, they 
are identified as the Goldstone bosons of the spontaneously broken chiral symmetry. 

Spontaneous chiral symmetry breaking is an entirely non-perturbative phe- 
nomenon. The task is then to explore the breaking mechanism and compute the 
value of the quark condensate (V Vy). As shall be outlined below, this can be achieved 
through the interplay of lattice simulations and effective low-energy descriptions of 
QCD. 


5.6.1 Chiral Perturbation Theory 


Chiral Perturbation Theory (ChPT) has already been mentioned in connection with 
extrapolations of lattice data to the physical values of the up- and down-quark 
masses, and also in the context of lattice determinations of the strange quark mass. 
Here we present a brief introduction into the general formalism. More thorough 
reviews can be found in Refs. [102, 103]. 

Chiral Perturbation Theory is an effective theory, based on a systematic expan- 
sion of the low-energy dynamics of QCD in powers of the 4-momentum and the 
quark mass about the chiral limit [89, 90], i.e. 


Lor = LR +L 4..., (5.169) 


where the superscripts label the order of the expansion in powers of p. In contrast to 
QCD, the basic degrees of freedom which appear in Leff are the Goldstone bosons, 
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rather than the fundamental quarks and gluons. ChPT is parameterized in terms of a 
set of empirical couplings, usually called “low-energy constants" (LECs). At lowest 
order, the effective chiral Lagrangian (in Euclidean space-time) reads 


(2) + 
LG} = FG {Tr (8.U7A,U) — BoTr (MU - U)) }, (5.170) 
where M = diag(m,, mg, ms) is the quark mass matrix, and U(x) collects the 
Goldstone boson fields, i.e. 


i 8 m’ + NL J/2n*  J2K* 
U(x) = exp (rw): i62) | VAT -n° + Jen AK? 
2 VIK- JIR? -2n 


The A^'s denote the Gell-Mann matrices which are normalized as Tr (A4 A?) = 28%. 
The LECs at leading order are Bo and Fo, where the latter corresponds to the pion 
decay constant in the chiral limit. 1° The expression for T ,ie.the interaction terms 
at next-to-leading order in the chiral expansion, contains 12 additional interaction 
terms, multiplied by the LECs L1,..., Lio, Hi, H2. The values of the LECs are 
usually determined by matching the expressions of ChPT for physical observables 
to experimental data. However, it turns out that the complete set of LECs cannot 
be obtained in this way. Rather, in order to fix the values of some LECs, one must 
resort to additional theoretical assumptions. One particular example is the value of 
Bo, which appears in the chiral expansion of the pion mass at lowest order (see also 
Eq. (5.80)): 


m2 = Bo(m, + ma). (5.172) 


From this expressions it is clear that Bo can only be determined using m; as input 
if the physical values of the quark masses are known in the first place. By the same 
token, the value of m = in, + mq) can only be inferred if an estimate for Bo 
is available. However, the a priori unknown parameter Bo drops out in suitably 
chosen ratios of m2, mg, .... This explains why ChPT can be used to predict the 
ratios of the light quark masses but fails to provide an absolute mass scale. Another 
reason why the complete set LECs cannot be determined from chiral symmetry 
considerations alone is the fact that the effective Lagrangian beyond leading order is 
invariant under a symmetry transformation which involves the LECs and the mass 
matrix M, but which is absent in QCD. This is the so-called “Kaplan-Manohar 
ambiguity" [104]. At this point itis clear that lattice simulations of QCD can provide 
valuable input for the determination of LECs. For instance, since the values of the 


'6We use capital symbols for decay constants whenever we refer to a normalization in which 
~ 93 MeV. 
T 
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quark masses are input parameters in the simulations, lattice QCD allows to map out 
the quark mass dependence of the masses of Goldstone bosons and thus determine 
the LEC Bo. We shall see below that Bo is related to the quark condensate (V) 
which can be considered as the order parameter for spontaneous chiral symmetry 
breaking. Furthermore, as we have already discussed in Sect. 5.5.6, absolute values 
of quark masses are accessible via lattice QCD. 

We end our brief introduction to ChPT with the derivation of a few relations 
which will be useful for our discussion of chiral symmetry breaking below. In 
particular, we shall derive the leading-order mass formulae such as Eq. (5.80) and 
establish a link between the quark condensate and Bo. To this end we expand the 
field U in the chiral Lagrangian LQ in powers of the Goldstone boson fields. 
Assuming exact isospin symmetry, m, = mq, one finds at lowest order in ġa: 


8 
1 
(2) X } 
Lott = 2 9 a9, 0a + "P 


a=1 


3 7 
1 1. 
+5 (mu + ma) BoD) ba +50 + m)Bo oa 


a=1 a=4 


UM 
tah + 2ms) Body + +--+ (5.173) 


After identifying $1, 62, $3 with the pions, $4, ..., $7 with the kaons, and ¢g = n, 
one derives the leading-order relations between the quark masses and the masses of 
the Goldstone bosons, viz. 


m? = 2Bom, my = Bo(h-- m,), m? = $ Bo + 2m;). (5.174) 
Thus, the relation for a generic pseudoscalar Goldstone boson made up of quarks 
with masses mı and m» is precisely what was already shown in Eq. (5.80). We note 
that from Eq. (5.174) one easily derives the Gell-Mann-Okubo mass relation, i.e. 


3m? + m2. — 4m% = 0, (5.175) 


which is satisfied experimentally within a few percent. Furthermore, Eq. (5.174) 
yields the ratio m; /m at lowest order, viz. 


2 2 2 
Ts = DUK T ~ 04, (5.176) 
m ms, 


which is already close to the estimate at next-to-leading order of m;/m = 24.4 + 
1.5 [38], quoted in Sect. 5.5.6. 

For the discussion of spontaneous symmetry breaking, it is useful to establish 
a connection between the quark condensate in QCD, (Ww), and the LECs which 
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parameterize the effective chiral Lagrangian. This link is provided by the so-called 
Gell-Mann-Oakes-Renner relation [105], which we are going to derive below. To 
this end we consider the QCD Lagrangian in the continuum: 


Lecp = -EFRO p(x) + 5 Wx) (yu Dy + m y) v p Qo). (5.177) 
f 


The path integral is defined as 


Zaw = f Dian) DIY, y] exp |- fe Loco} (5.178) 


and the expression for the quark condensate can be formally derived by taking 
derivatives with respect to the light quark masses, i.e. 


8 In Zoco 2 = = z 
2 EE — (uu + dd + $5]l,, o = (ww). (5.179) 


f=u,d,s m —0 


What is the analogue of this expression in the effective chiral theory? To answer this 


question one takes the lowest-order chiral Lagrangian of Eq. (5.170) and defines the 
corresponding path integral!" 


ZchPT = f wee |- [o's e (5.180) 


Since pu contains the quark mass matrix one can consider similar derivatives, i.e. 


FOB 
SInZcuery — pen Y i (tmu v). EB us 
f=u,d,s ðm p my-0 2 f=u,d,s m, di i omo 
(5.181) 
and comparison with Eq. (5.179) yields 
1 " 
E (uu + dd + 5s) = X = F Bo. (5.182) 


In other words, the quark condensate is related to the slope parameter in the lowest- 
order mass formulae and the pion decay constant in the chiral limit, Fo. This result 
is known as the Gell-Mann-Oakes- Renner relation. 


'7Tt should be obvious that the field U must not be confused with the link variable considered in 
previous sections. 
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5.6.2 Lattice Calculations of the Quark Condensate 


The Gell-Mann-Oakes-Renner relation is the starting point for many lattice deter- 
minations of the quark condensate. For a generic pseudoscalar meson consisting of 
a mass-degenerate quark and antiquark, i.e. mı = m2 = m, the LEC X is given by 


2 F2 
y= jin i (5.183) 


m—>0 2m 


The technical drawback of this straightforward approach is that the chiral limit in 
the above expression is difficult to take in practice, as we have mentioned several 
times already. In the quenched approximation the situation is even worse: due to the 
appearance of quenched chiral logarithms (c.f. Eq. (5.82)) the ratio Mpg /m becomes 
singular at vanishing quark mass, and hence the chiral limit does not exist. Since 
the quenched approximation is being abandoned, this issue will gradually become 
irrelevant. 

However, a more serious obstacle remains in the case of dynamical simulations 
with Wilson fermions: since this particular type of regularization breaks chiral 
symmetry explicitly, the matching of simulation data at non-zero lattice spacing to 
the expressions of ChPT is—strictly speaking—not permitted. Matching is certainly 
justified if a fermionic discretization is employed which preserves chiral symmetry, 
such as overlap or domain wall fermions, or if results obtained using Wilson 
fermions are extrapolated to the continuum limit before a comparison to ChPT is 
performed. 

A complementary approach for determining the condensate on the lattice is based 
on the Banks—Casher relation [106]. It provides a link between the LEC X and the 
spectral properties of the Dirac operator, viz. 


= lim lim lim Few), (5.184) 


A—0m—0V—oo V 


where V is the space-time volume. The spectral density o (A) is defined as follows: 
Let D denote the massless Dirac operator in the continuum, satisfying (ys, D} = 0. 
Its eigenvalue equation reads 


Dy, = iàn Yn, An ER, (5.185) 


where the eigenvalues and eigenfunctions depend on the gauge field. A suitable 
definition of the spectral density is then represented by 


pr) = M (8 — 24)). (5.186) 


n 
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where the expectation value is taken with respect to the QCD functional integral. !? 


Note that in Eq. (5.184) the ordering of limits must be obeyed. In particular, since 
the spontaneous breaking of a continuous symmetry cannot occur in finite volume, 
the limit V — oo must be taken before the chiral limit and the spectrum in the deep 
infrared are considered. 

The Banks—Casher relation provides not only a method to determine the conden- 
sate, but also suggests a mechanism how spontaneous chiral symmetry breaking 
comes about. Indeed, Eq.(5.184) implies that a non-zero value of the quark 
condensate is generated through a non-vanishing value of the spectral density in 
the deep infrared. In other words, spontaneous chiral symmetry breaking is driven 
by an accumulation of small eigenvalues. An immediate consequence of the Banks— 
Casher relation is that the level spacing AA between the small eigenvalues is given 
by 

1 T 


Hence, as V —> œ the level spacing becomes arbitrarily small. In the free theory, 
ie. in the absence of a non-trivial gauge field one finds that p(A) œ A3, which 
vanishes as A — 0. The accumulation of eigenvalues near zero with a rate predicted 
by Eq. (5.187) must therefore arise through the interaction with the gauge field. 

In order to test the Banks-Casher scenario, a possible strategy is to compute 
the spectral density and check whether it actually produces an arbitrarily dense 
spectrum near the origin. Analytic predictions for o(A) can be derived in the 
framework of effective theories of QCD at low energies, namely ChPT, as well 
as chiral Random Matrix Theory (RMT). The latter also yields predictions for the 
distributions of individual eigenvalues, in addition to the spectral density. 

Chiral Random Matrix Theory goes back to an idea of Wigner who tried to utilize 
statistical properties for the theoretical description of systems with many degrees 
of freedom and complicated dynamics, such as nuclear resonances. Rather than 
trying to model the local interactions within such a system explicitly, all possible 
interactions that are consistent with the symmetries of the theory are equally likely. 
The Hamiltonian is then approximated by a matrix whose elements are uncorrelated 
but obey a particular probability distribution. The main guiding principle for the 
RMT description of QCD is the requirement that all global symmetries must be 
respected. The massless Dirac operator can then be represented by an N x N matrix 
D with an off-diagonal block structure which is characteristic for systems with 
chiral symmetry: 


a ( 9 W\}Ny 
b- e 4 We (5.188) 


18A normalization factor of V7! must be included in Eq. (5.184) since p(A) is proportional to the 
volume. 
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As illustrated by the above expression, the matrix W is, in general, rectangular with 
N, rows and N_ columns, such that N = N} + N_. For Ny Æ N- the matrix Db 
has |N.. — N_| zero modes, and the index v = N, — N_ may be identified with the 
topological charge in QCD. With this definition, D is anti-hermitian and has purely 
imaginary eigenvalues which come in complex conjugate pairs: 


Don = inn, Un ER. (5.189) 


One can define the system’s partition function in a sector of fixed topological charge 
v via 


x 1 + 
Z, = J D[W] det (D + m) e- 2 Tr QW). (5.190) 


where Nf is—as usual—the number of dynamical quark flavours. It makes sense to 
identify the matrix size N with the physical volume V of the theory (up to some 
proportionality constant). 

In order to study the spectral properties of D in the deep infrared, it is useful to 
rescale the eigenvalues by the system size 


Z=UnN, N«V (5.191) 
since, according to Eq. (5.187), the level spacing of the scaled eigenvalues z is of 


order one. The so-called microscopic spectral density in the sector of topological 
charge v is then defined as 


pre) im. ) 06 is Ns, (5.192) 


where the expectation value (- - - ),, is taken with respect to the partition function Zy. 


An explicit expression for oO (z) in terms of Bessel functions has been worked out 
by Verbaarschot and Zahed [107] 


p™ (z) = ; [^or — Ine+v41(Z) Ini C] (5.193) 


The microscopic spectral density is the convolution of the distribution functions p” 
of the individual scaled eigenvalues, i.e. 


oo oo 
Po Po f aoz (5.194) 
k=1 
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Fig. 5.16 RMT predictions 5 r TE aa mr 
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Chiral RMT yields predictions for these distributions. For instance, for the lowest 
eigenvalue in the sector with v = 0 one obtains for Nr = 0 


1 
pz) = ee (5.195) 


For further illustration the microscopic spectral density and the distribution func- 
tions for a few of the lowest eigenvalues are plotted in Fig.5.16. The result for 
ny) (z) indicates that an accumulation of small eigenvalues does indeed take place. 
Since one considers the simultaneous limits u — 0 and N — oo for fixed z, a 
non-zero value of pP) for finite z signals that the spectrum is packed more and 
more densely near the origin. 

Can the predictions of RMT be verified from first principles in simulations of 
lattice QCD? The answer is ‘yes’, provided one considers a particular kinematical 
situation, commonly referred to as the “e-regime” of QCD. It is based on the 
formulation of QCD in a large but finite volume of spatial size L and for arbitrarily 
small quark mass. The Compton wavelength of the pion then exceeds the spatial 
size, and thus the e-regime is characterized by 


ML « 1, F,L> 1. (5.196) 
In this particular situation the path integral of the theory is dominated by zero 
momentum modes. In a symmetric finite box with volume V = L^, the minimum 


non-zero momentum is given by Pmin X 1/L. Let us recall the expression for the 
lowest-order effective chiral Lagrangian, i.e. 


La inl 1Tr (Ə Uta LU) — mE Tr (e?/ "rU + h.c.) |, (5.197) 
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where we have included the vacuum angle 0 and assumed that M = m1. If the 
quark mass rn is tuned so that 


mE K Fi phin ~ FE/LP, (5.198) 


the statistical weight of fields with 0,,U # 0 will be strongly suppressed in the path 
integral. In other words, the mass term will dominate over the kinetic term, except 
for fields U with à,U = 0. Since 2m Z/ Fj = mpg, the conditions in Eq. (5.196), 
which define the kinematical situation of the e-regime, are equivalent to 


mV <1. (5.199) 


The zero-momentum part can be represented by a constant SU(3) matrix Up such 
that 


U(x) = Uge”) Fo, Ug € SU(3), (5.200) 


where the field € incorporates the fluctuations about the zero momentum mode. 
According to Leutwyler and Smilga [108], the path integral of the theory in 
topological sector v can be written in the form 


ZO = f D[Uo] (det Ug)" exp (m Z V ReTr Ug) . (5.201) 


After this somewhat lengthy preparatory discussion, the connection between QCD 
in the e-regime and chiral RMT can finally be established. An important result 
derived by Shuryak and Verbaarschot [109] states that the path integral Z (0) can be 
mapped exactly onto the partition function Z, of RMT. One therefore expects that 
the low-lying eigenvalues of QCD in the e-regime are distributed in the same way 
as those in RMT. By computing the former in a lattice simulation and performing 
a comparison to the analytically known distributions in RMT, one may verify the 
Banks-Casher scenario of spontaneous chiral symmetry breaking. 

The Neuberger-Dirac operator Dy of Eq. (5.47) is ideally suited for this task. 
Since it satisfies the Ginsparg-Wilson relation, chiral symmetry is preserved at the 
level of the discretized theory. Furthermore, Dy can be shown to satisfy an exact 
index theorem, so that it sustains |v| exact zero modes on gauge configurations 
with topological charge v. This allows for an unambiguous identification of 
topological sectors to which the path integral Z ©) is restricted [110]. Therefore, the 
investigation of spontaneous chiral symmetry breaking is a prime example where it 
is absolutely vital that the lattice-regularized theory obeys the same symmetries that 
are present in the continuum. 

Before we proceed we must elucidate the relation of the spectra of the random 
matrix D and the N euberger-Dirac operator. While the eigenvalues of D are purely 
imaginary, the operator Dy is unitary, and hence its eigenvalues lie on a circle with 
radius 1/a in the complex plane, centered around the point 1/a on the real axis. 
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Fig. 5.17 Comparison of 
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Thus, if y denotes an eigenvalues of Dy, it can be parameterized as 


a 


(5.202) 


1 . 
= — 1 = ip š a= . 
4 zl e ) ý l+s 
Since the radius of the circle diverges in the continuum limit, the low-lying part of 
the spectrum satisfies |y| < 1/a, and hence Rey ~ 0. One can then identify an 
eigenvalue u of D with Imy, i.e. 


1 
Lh < Iimyzciy|- = [2(1 — cos @)]. (5.203) 


A simple but effective check of the RMT description of the low-lying spectrum can 
be performed by comparing ratios of scaled eigenvalues. The combination |yg| X V 
of the kth eigenvalue in QCD corresponds to ju, N in RMT. If the low-lying spectra 
in the two theories indeed coincide one expects the following equalities in a given 
topological sector v 


(vey ! (kv - [ (v) 14 (v) 
= = d dzz py (2). 5.204 
Vi uc dy ER SOR TO (5.204) 


While the ratio (|yx|)v/(lyjl)v is determined in the simulation, the two integrals on 
the right-hand side can be evaluated analytically for the first few eigenvalues. !? 

In Refs. [111, 112] ratios for some of the lowest eigenvalues have been computed 
in the quenched approximation. The results from [111] are shown in Fig. 5.17 for a 
box size L = 1.49 fm. The agreement between lattice results and RMT is excellent. 
By contrast, a smaller box size of about 1fm yields significant discrepancies 
between QCD and RMT, which can be as large as 10 standard deviations. This 
is a reflection of the fact that the large volume limit must be taken before the RMT 


I?The expressions for the distributions pi (z) become rapidly more complicated as k increases, 
so that one may have to resort to numerical evaluations of the integrals. 
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behaviour sets in. Similar findings have been reported for QCD with Nf = 2 flavours 
of dynamical overlap quarks [113]. 

The confirmation of the RMT prediction for the distribution of the low-lying 
eigenvalues supports the Banks—Casher scenario of spontaneous chiral symmetry 
breaking. In a subsequent step one may therefore extract the LEC » via the relation 


(lye EV = (uN) - deeply (5.205) 


If X is identified with the expectation value of the scalar density, as suggested 
by the effective low-energy description of QCD, it must be related to a particular 
continuum scheme, like the MS-scheme of dimensional regularization. If the reg- 
ularization prescription obeys chiral symmetry, the corresponding renormalization 
factor, Zs, satisfies 


Zs = Zp = l/Zm. (5.206) 


where Zq relates the bare quark mass to the chosen continuum scheme (for instance, 
MS). Provided that Zs, or equivalently, Zm has been computed for a range of bare 
couplings, the lattice estimates for X can be used to determine the renormalized 
condensate in units of some scale, e.g. 


rà Dye) = Zs (go, au)r E + O(a’). (5.207) 


For the Neuberger-Dirac operator, Zs has been computed non-perturbatively in the 
quenched approximation [114], employing the technique outlined in Ref. [115]. The 
resulting values for Zs could then be combined with the results for X extracted from 
the matching to RMT from [111]. A subsequent extrapolation to vanishing lattice 
spacing yields the results for the renormalized condensate in the continuum limit: 


Xy GeV) = (285+9MeV)?, (scale set by fk). (5.208) 


The quoted error represents the total uncertainty arising from statistics, the uncer- 
tainty in the renormalization factor, and the continuum extrapolation. If the nucleon 
mass is used to set the scale the central value drops to 261 MeV, as a consequence 
of the scale ambiguity encountered in the quenched approximation. We stress once 
more that the chiral condensate is ill-defined in the quenched theory, and thus great 
care must be taken when the results are interpreted in the context of the full theory. 
Nevertheless, it is encouraging that for Ne = 2 flavours of dynamical quarks, a 
similar calculation [113] finds Xxx (2 GeV) = (25147411 MeV)? ata ~ 0.11 fm, 
in good agreement with the quenched result, given the inherent ambiguities and 
inconsistencies of the latter. 

Lattice results for the condensate have been reported by many other authors (e.g. 
[116-125], employing a variety of approaches. Although the various calculations 
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are subject to different systematics, the overall picture is rather consistent, with val- 
ues for the condensate centering around (250 MeV)?. As for many other quantities, 
the influence of lattice artefacts and renormalization effects must be studied in more 
detail, especially in the case of fully dynamical calculations. It is also important 
to mention that analytic non-perturbative approaches to the strong interaction, such 
QCD sum rules, also give broadly consistent results with lattice simulations within 
the quoted uncertainties (see e.g. [126—128] and references therein). This completes 
the consistent picture of chiral symmetry and its spontaneous breaking in QCD. 


5.7] Hadronic Weak Matrix Elements 


The experimental programme at the B-factories BaBar and Belle, as well as many 
other experiments at high-energy colliders, such as the Tevatron and LEP, have 
greatly enhanced the accuracy of many observables related to flavour physics and the 
Cabibbo-Kobayashi-Maskawa (CKM) matrix. The main motivation for studying 
flavour physics is to gain a proper understanding of CP violation and, in turn, 
the matter-antimatter asymmetry which is apparently manifest in the universe. CP 
violation is incorporated into the Standard Model via a complex phase in the CKM 
matrix, and therefore a precise knowledge of its elements is required to decide 
whether or not additional sources of CP violation must be considered. 

In order to make these statements more precise we recall some basic definitions. 
As is well known, the CKM matrix Vckw relates flavour to mass eigenstates. For 
flavour-changing charged current transitions between up- and down-type quarks this 
implies that, in addition to the dominant transitions like u <> d, c <> s and t < b, 
there are further transitions of lesser strength. The CKM matrix is therefore expected 
to possess a hierarchical structure, with the diagonal elements V,;, Ves and Vip 
being of order one. An approximate parameterization that takes this into account 
is due to Wolfenstein [129]. By expanding Vcxm in powers of the Cabibbo-angle 
|Vus| =A x 0.22 one obtains 


j 2272 A AA (p i 
Vua Vus Vub / (o im 
VckM = | Vea Ves Va | > | —X— iA?35q 1—1°/2 AX? ; 
Via Vis Vib AX = 5- iñ) AA? ay 1 
(5.209) 


with the remaining parameters A, p and 7 of order one.” In the standard model, 
VckM is unitary, and, provided that one can determine its elements with sufficient 
precision, any deviation from unitarity would be a signature of “new physics". 


20The relation of rescaled parameter p to p is given by p = p(1 — A2/2 + O(A4)), and a similar 
relation holds for 7 and n. 
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Fig. 5.18 Constraints on the 
apex of the unitarity 
triangle [130] 
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Unitarity gives rise to relations such as 
Vud Vip + Vea Vip + Via Vi = 0. (5.210) 


which can be represented by a triangle. The strategy that has been adopted in order 
to search for hints of new physics, is to use experimental and theoretical input to 
over-constrain the unitarity relations like those in Eq. (5.210). The current status is 
depicted in Fig. 5.18, where the unitarity triangle is plotted in the (ø, 7)-plane [130]. 

The experimentally measured quantities, i.e. the mass differences AM;, AMg 
and e, the latter of which parameterizes indirect CP violation in the kaon system, 
serve to constrain the apex of the unitarity triangle. They are proportional to the 
relevant CKM matrix elements, i.e. 


G2M2 in : 2 AM, fà 8B, mg, |Vis|2 
AMa = np s(——) fg Ês |Via V|", LIA 

67 Mw AMa — fiBg ms Vil 
ek « By Im(V,4 VŽ), (5.211) 


where Gr is the Fermi constant, and Mw, m; denote the masses of the W-boson and 
top quark, respectively. The proportionality factors in the above expressions involve 
the leptonic B-meson decay constants fg and fp,, as well as the B-parameters Bs, 
Bs, and Bx, which in turn parameterize the transition amplitudes for p= B°, 
B? — B®, and K? — K? mixing. While the decay constants are difficult to measure 
with sufficient accuracy, due to the fact that the leptonic decay rates are suppressed, 
the B-parameters are not at all accessible in experiment. One must therefore resort 
to theoretical estimates of these quantities. Since non-perturbative effects must 
inevitably be included, lattice simulations of QCD are ideally suited for this task. 
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Lattice calculations of weak hadronic matrix elements is a major activity within 
the lattice community, and a thorough coverage of all aspects would easily fill 
an entire chapter. We shall therefore concentrate on some of the most important 
quantities, and point out the main conceptual issues. Itis strongly recommended that 
the reader consult the regular reviews of the topic at the annual lattice conferences, 
e.g. [131-135]. 


5.7.1 Weak Matrix Elements in the Kaon Sector 


In the kaon sector, K? — K? mixing is one of the most important processes. The 
B-parameter Bx parameterizes the non-perturbative contribution to indirect CP 
violation. It is defined by the ratio of the relevant operator matrix element to its 
value in the so-called “vacuum saturation approximation": 


(K° |o45=2(u)| K?) 


Bx(u) = 
3 fkmk 


(5.212) 


Here, u denotes the renormalization scale at which the AS = 2 four-quark operator 
Q^5—?. defined by 


os? = [sy,(1 — ys)d] [syy — ys)d] = Ovv+aa — Ovazav, — (5:213) 


is considered. The relation between ey and the CKM matrix elements is provided 
by the RG-invariant B-parameter Bx. In NLO perturbation theory Bx is related to 
Bx (1) via 


» faye rr”? {= [boi — bivo 
Bk = ( Am ) 1+ gh) EE Bx(w), (5.214) 


where yo, yı denote the coefficients in the perturbative expansion of the anomalous 
dimension of Q45=?. Since QCD is parity-conserving, the physically relevant 
operator in the above expression is the parity-even combination Oyy+aa. The 
typical left-handed chiral structure of this operator, which is characteristic for 
weak transitions, poses a problem for lattice calculations if Wilson fermions are 
employed. In this case the discretization breaks chiral symmetry explicitly, and 
thus Ovyy+Aa mixes under renormalization with operators involving the opposite 
chirality. Therefore, the general renormalization pattern is 


4 
OS y LAN QD = Z(g0, an) OT LAA sr 2 sore] (5.215) 
i=l 
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Thus, in order to determine the physical matrix element, one must not only 
determine the overall renormalization factor Z, but also the mixing coefficients 
Aj. Several techniques have been developed [136-138] to address this problem, 
which is merely an inconvenience rather than a serious obstacle. In a formulation 
based on staggered fermions the problem is absent, since the remnant U(1) & U(1) 
symmetry protects the operator from mixing with other chiralities. However, a 
drawback of the staggered formulation is the broken flavour (“taste”) symmetry, 
which may lead to significant complications [139]. Fermionic discretizations based 
on the Ginsparg-Wilson relation, such as domain wall or overlap fermions do not 
suffer from the mixing problem, whilst preserving all flavour symmetries. Finally, 
the mixing problem can also be circumvented for Wilson-like discretizations in the 
context of twisted-mass QCD [140, 141]. With the help of a suitably chosen flavour 
rotation (see Eq. (5.51)), the matrix element of Ovy+aa in QCD can be mapped 
exactly onto that of the parity-odd operator Oya+ay in the chirally twisted theory, 
viz. 

(° GaN Em B i(k° [Raal KU co: ae) 
It has been shown that Oya+ay renormalizes purely multiplicatively [142], i.e. 
all mixing coefficients vanish. The overall multiplicative, scale-dependent renor- 
malization factor of Oya+ay which yields the physical matrix element has been 
determined non-perturbatively [143], using the finite-size scaling procedure based 
on the Schródinger functional formalism described in Sect. 5.5.2. 

We now give a summary of the current status of Bx. Here, the calculation 
by the JLQCD Collaboration [154], based on staggered quarks in the quenched 
approximation, has served as a benchmark result for a long time. Their result, 
for which the perturbatively renormalized matrix element was extrapolated to the 
continuum limit, has since been confirmed by many other calculations employing 
different fermionic discretizations and different renormalization techniques. These 
include domain wall [148, 149] and overlap quarks [150, 151], as well as the 
Wilson formulation [153, 155]. Moreover, a calculation employing twisted mass 
QCD has been completed [152], which includes non-perturbative renormalization 
and a thorough investigation of the continuum limit. 

Recently, results for Bk from simulations with dynamical quarks have become 
available, both for Ne = 2 [146, 147] and Ne = 2 + 1 flavours [144, 145]. A 
compilation of quenched and unquenched results is shown in Fig. 5.19. Although 
the figure suggests a trend in the data which points to slightly lower estimates for 
By if dynamical quarks are switched on (see Fig. 5.19), the quoted uncertainties are 
still too large to point to a significant deviation. In particular, a systematic study 
of the continuum limit in the unquenched case is not yet available. It is interesting 
to compare the results for Bk to the non-lattice determination in Ref. [130]. Here, 
the determinations of the angles of the unitarity triangle from experimental data 
in conjunction with direct measurements of AM4, AM; and ey allow to fit the 
values of several of the quantities in Eq. (5.211), which incorporate the hadronic 
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Fig. 5.19 Recent lattice results for the RGI kaon B-parameter By. From top to bottom, the 
plotted values are taken from Refs. [144—154]. Dotted error bars (where shown) indicate the 
quoted systematic error. The labels include information on the fermionic discretization and the 
intermediate renormalization scheme, if non-perturbative renormalization was used. We also 
indicate whether or not the results have been extrapolated to the continuum limit. The vertical 
lines represent the (non-lattice) result from [130], with the quoted uncertainty (see text) 


uncertainties. In this way one obtains pron = lattice _ 0,94 + 0.17, which is shown 
as the vertical band in Fig. 5.19. Clearly, within the rather large error margins, this 
result is compatible with all lattice determinations, quenched or unquenched. 


First Row Unitarity and the Value of |V,,| In addition to Eq. (5.210), the unitarity 
of the CKM matrix implies many other constraints on its elements, such as those 
which appear in the first row: 


[Vua]? + |Vas 2 + Vul? = 1. (5.217) 


Owing to the smallness of |V,;|, i.e. |Vusl? c 2-107, the direct verification of first 
row unitarity with the current experimental and theoretical accuracy rests on the 
precise knowledge of |V,,a| and |V,s|. The value of |V,4| can be determined with 
high accuracy from super-allowed nuclear f-decays (0* — O" transitions), and in 
the current edition of the particle data book the best estimate is quoted as [61] 


|Vua| = 0.97377 + 0.00027. (5.218) 


5 QCD on the Lattice 219 


The value of | V,,| can be extracted from the decay rate of K 3 transitions, i.e. 


2 
, 


Gim% 2| pK 
T(K > rly) « IVasl |^ 7 (0) (5.219) 


19273 


where f. a T is one of the two form factors which parameterize the hadronic matrix 
element for semi-leptonic K — 7r£v; transitions, i.e. 


(n Gaz) (Evu) O)| K (PK)) = £87 (aD (pk + Pade + £A" (x — pu. 
qu = (PK — Pr) p- (5.220) 


In order to arrive at a precise estimate for |V,,5|, f. Ps m (q?) must be determined with 
an accuracy at the level of 1%, since the decay rate and hence the combination 
[Vos T n ]? can be measured rather precisely. The form factor f rag admits a 
chiral expansion; At zero momentum transfer it reads 


fk" O=1l+ftfat.... (5.221) 


While the leading chiral correction, f? = —0.023, has been computed long ago 
[156], knowledge on f4 and the higher corrections is still fairly limited. The strategy 
pursued in lattice calculations [157] is based on computing the quantity 


Af = fO- (+ fa), (5.222) 


which is a measure of the contributions beyond leading order. An old phenomeno- 
logical estimate by Leutwyler and Roos [158] yields the value Af = —0.016(8). 
It is clearly desirable to check this result and ultimately replace it by a model- 
independent estimate based on QCD. 

Semi-leptonic form factors can be determined in lattice simulations by comput- 
ing suitable three-point correlation functions, in which the initial and final hadronic 
states are projected onto non-vanishing momentum. The main issues that must be 
addressed in order to judge the accuracy of the form factor determination are listed 
in the following: 


* The dependence of the form factors on the momentum transfer q? must be 
modelled, in order to interpolate their values to q? = 0. Typical anséitze 
for the interpolation include linear or quadratic functions of g?, as well as 
formulae based on pole dominance [159]. The freedom of choosing a particular 
ansatz introduces a certain ambiguity, since different model functions yield 
slightly different results. Via the introduction of so-called twisted boundary 
conditions [159—164], the q? resolution of form factors can be significantly 
improved; 

* As for all quantities involving pions, a chiral extrapolation of lattice results 
must be performed. Clearly, in order to obtain f hag (0) and hence Af with 
small controlled errors, a reliable chiral extrapolation is perhaps the single most 
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Table 5.4 Recently published lattice results for the quantity Af 


Collaboration Ng Action | f X 7 (0) Af gi min [MeV] 
Becirevié et al. [157] 0 Clover |0.960(5)(6) | —0.017(5)(7) | 490 

RBC [165] 2 DWF 0.968(9)(6) | —0.009(9)(6) |490 
UKQCD/RBC [166] 2+1 |DWF 0.964(5) —0.013(5) 330 
Leutwyler and Roos [158] A. JI 0.961(8) —0.016(8) A. 

Bijnens and Talavera [167] JL A. 0.976(10) —0.001(10) ih 

Jamin et al. [168] l A. A. 0.974(11) —0.003(11) A. 
Cirigliano et al. [169] ds A. 0.984(12) 0.007(12) J. 


The minimum value of the pion mass used in the simulations is listed in the right-most column. 
The lower part of the table contains analytical estimates 


important issue. Thus, the ability to simulate as deeply as possible in the chiral 
regime will be decisive for the final accuracy; 

* Other systematic uncertainties include control over lattice artefacts, which is 
closely related to the renormalization of local operators, such as the vector 
current, which appears in Eq. (5.220). If chiral symmetry is broken explicitly, 
the (local) vector current is not conserved, and in order to guarantee a smooth 
approach to the continuum limit, its renormalization factor, Zy, must be included. 
However, in all recent simulations the form factor has been extracted from 
suitably chosen ratios in which Zy drops out. 


A compilation of recent results for the form factor f ry ? (0) and the quantity Af 
are presented in Table 5.4, where they are compared to analytical estimates. The 
agreement with the old result by Leutwyler and Roos is quite striking. Despite a 
tendency among the more recent analytical calculations to produce slightly larger 
estimates for Af, all results are in good agreement within the quoted uncertainties. 

An alternative method to determine |V,,| from experimental data was proposed 
by Marciano [170]. Instead of considering semi-leptonic decays, it is based on the 
leptonic decay rates, i.e. 


PK > u) Wus? fern 
Tire (y)) Val? f2m, 


(5.223) 


Hence, the task is to provide an input value for the ratio of decay constants, fx /fr. 
This quantity is well-suited for lattice calculations in several respects: first, ratios 
of quantities can be computed with high statistical accuracy, owing to the fact 
that the fluctuations in the numerator and denominator are correlated. Second, 
the renormalization factor of the axial current, Z4, drops out in the ratio fk/fz. 
However, since the quantity of interest involves a chiral extrapolation, the same 
caveats as in the case of the pion form factor, apply in this case. In particular, 
it is mandatory to go as close as possible to the physical mass of the pion. The 
quenched approximation is clearly of very limited value in this context, since the 
chiral behaviour and hence the actual value of fx/f; may strongly depend on the 
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Table 5.5 Recently published results for fx /fz in lattice QCD with dynamical quarks 


Collaboration Ne Action FS / fa m min [MeV] 
CP-PACS [52] 2 Clover 1.19(3) 500 
JLQCD [172] 2 Clover 1.14811)(13)G) 500 
ETM [100] 2 tmQCD 1.227(9)(24) 290 
MILC [173] 2+1 Stagg. 1.208(2)(,1) 290 
NPLQCD [174] 2+1 Stagg./DWF 1.218001) 290 
RBC/UKQCD [175] 241 DWF 1.242) 330 
HPQCD [176] - 2+1 Stagg. 1.189(7) 250 


number of active sea quarks. Furthermore, it is known that in the continuum limit of 
the quenched approximation the value fx / fr is underestimated by about 10% [171]. 

Recent results for fk/fr in lattice QCD with dynamical quarks are listed in 
Table 5.5. A caveat that applies to all such compilations is that systematic errors 
are not estimated in a uniform manner. For instance, none of the listed results (with 
the exception of [52]) is based on a systematic scaling study aimed at separating 
cutoff effects from the actual mass dependence, although the influence of lattice 
artefacts has been included in some error estimates by including cutoff effects into 
a generalized chiral fit. Moreover, not all of the listed values of fk/f; include 
finite-volume corrections, which can be computed in ChPT and incorporated into 
the ansatz for the chiral fit [177, 178]. Despite these caveats it appears, though, that 
the estimates for fx/f; based on fits including pion masses well below 500 MeV 
are compatible with each other. 


5.7.2 Weak Matrix Elements in the Heavy Quark Sector 


The main obstacle for calculations of weak matrix elements involving heavy quarks, 
and in particular the b-quark, is that one is faced with a multi-scale problem. In 
Sect. 5.2.5 we have already discussed systematic effects in lattice calculations that 
arise from finite-size effects and lattice artefacts. Translating the relations in (5.79) 
directly to the b-quark sector, one finds that the following inequalities cannot be 
satisfied simultaneously, at least not with the currently available computer power: 


amp Xl, mgL>1, Lja < 50. (5.224) 


Violation of the first relation implies the presence of large lattice artefacts, the 
second inequality must be satisfied if one wants to avoid uncontrolled finite-volume 
effects, and the third is dictated by memory capacities of current computers. With 
a b-quark mass of mp ^ 4GeV and typical inverse lattice spacings of a~! < 
4.5 GeV, it is evident that the b-quark cannot be studied directly, since its Compton 
wavelength is smaller or of the same order of magnitude than the lattice spacing 


Itself. 
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Several strategies to deal with this problem have been applied over many years, 
among them the "static approximation" [179], the non-relativistic formulation 
(NRQCD) [180], the so-called “Fermilab-approach” [181] and finite-size scaling 
techniques [182, 183]. 

Since the charm quark is lighter than the b-quark by roughly a factor three, one 
may attempt to treat charm as a fully relativistic, propagating quark in simulations. 
Still, one can incur large lattice artefacts in this way, and a careful extrapolation 
to the continuum limit is then required. However, such an extrapolation may be 
spoilt if the leading lattice artefacts cannot be isolated in the results, due to the 
relatively large mass of the charm quark. Still, if one has reason to trust the results 
obtained for relativistic charm quarks, one may extrapolate them to the mass of b- 
quark, which is yet another way of circumventing the problem that the b-quark is 
too heavy to be treated relativistically. Typically, the ansatz for the extrapolation of 
a particular quantity to the mass of the b-quark is motivated by its expected quark 
mass dependence in Heavy Quark Effective Theory (HQET). 

In the static approximation the b-quark is assumed to be infinitely heavy [179]. 
In this formalism it is convenient to represent the b-quark by a pair of spinors, 
(Wn, Wz), which propagate forward and backward in time, respectively, and which 
satisfy 


Po — 44. P-yp— 4; P= 30 y). (5.225) 


While the field W, annihilates a heavy quark, v; creates a heavy antiquark. The 
dynamics of these fields in the discretized version of the theory is described by the 
Eichten-Hill action [184] 


set at [emm]. £58 WOVE), L = gaVo), 
l (5.226) 


where Vo, V5 denote the forward and backward covariant lattice derivatives in the 
temporal direction. Although the numerical computation of the quark propagator 
based on the Eichten-Hill action is relatively “cheap”, simulation results in the static 
approximation typically suffer from relatively large statistical noise. Without going 
into detail we note that the signal-to-noise ratio can be significantly improved if one 
replaces the temporal link variables in Vo and Vj by suitably chosen generalized 
parallel transporters. A full account can be found in Ref. [185]. 

Obviously, the static approximation represents only the leading term in an 
expansion of the quark action in inverse powers of the heavy quark mass, and thus 
one expects corrections in powers of 1/mj. As described in Ref. [182], one can 
set up a formalism in which the leading corrections to physical observables can be 
systematically computed as operator insertions in correlation functions defined with 
respect to the static action $***, Again, we refrain from describing any further details 
and refer the reader to the original literature [182, 186]. 
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Higher-order corrections to the static approximation can also be incorporated into 
the theory by adding the appropriate 1/mp terms to the action itself. In this way one 
obtains a non-relativistic version of QCD (NRQCD) [180], in which the mass of the 
heavy quark is imposed as a cutoff on relativistic momentum modes, i.e. 


p^ mv K mp, (5.227) 


where v denotes the four-velocity of the heavy quark. Heuristically, the introduction 
of the cutoff is justified since the internal typical momentum modes of hadrons 
containing a heavy quark are much smaller than the mass of the latter. The loss of 
relativistic states can be compensated by adding new local interaction terms order 
by order in p/mn ~ v to Li and £3", In general, these additional interaction 
terms will generate mixing between quark and antiquark. However, by applying a 
Foldy-Wouthuysen transformation, the fields can be decoupled. At the level of the 
classical theory, the 1/71; correction to the NRQCD Lagrangian for the forward 
propagating field reads 


1 


Mh 


gies = [Jn D-Dyn 4+ Vno Byrn} : (5.228) 


and D is the vector of the covariant derivatives in the spatial directions. 

In the quantized version of the theory, the coefficients which multiply the fields 
in the above expression become dependent on the gauge coupling and must be 
appropriately tuned to guarantee the correct matching of the non-relativistic theory 
to standard QCD at order in 1/mp. Thus, the lattice-regularized version of the 1/mp 
correction reads 


Biene fory Vin + ooo Byi] ; (5.229) 


where B denotes a lattice representation of the magnetic field. The coefficients a 
and c» are formally of order 1/mp and are found to be linearly divergent in the 
lattice spacing a. Therefore, at a given order in the non-relativistic expansion of 
the action, a finite cutoff must be kept, and in this sense the effective theory is non- 
renormalizable. All this implies that in NRQCD the continuum limit, a — 0, cannot 
be taken. Instead, one must argue that lattice artefacts are small in the range of lattice 
spacings where the calculations are performed. 

Another approach can be based on the idea that the Wilson fermion action is 
suitably adapted for heavy quarks, such that the Wilson quark propagator does not 
deviate from the continuum behaviour even for quark masses am Z 1, i.e. for quark 
masses near or above the cutoff [181]. According to Ref. [181] this can be achieved 
by modifying the normalization of the quark fields (see Eq. (5.36)) in the discretized 
lattice theory, i.e. 


W(x) > V2K e" "9" y (x), V (x) > W(x) e? "9/2. / 2. (5.230) 
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where the "pole mass" amp of the Wilson propagator is given by 
amp = In(1 + am), (5.231) 


and am denotes the bare subtracted quark mass in the Wilson theory (see Eq. (5.39)). 
The factor 4/2x e^"?/? is designed to interpolate smoothly between the relativistic 
and non-relativistic regimes. As a consequence, in order to cancel the effects of large 
quark masses in hadronic matrix elements involving b-quarks, the normalization of 
quark fields is modified according to the above prescription. The so-called “Fermi- 
lab approach" to heavy quark physics on the lattice is based on the normalization 
in Eq. (5.230). Essentially it amounts to formulating an effective theory for quarks, 
whose spatial momenta are small, |ap| « 1, with mass-dependent coefficients. 
Like in the case of the static approximation, the formalism allows to take the 
continuum limit. Related approaches to the Fermilab method have been presented 
in Refs. [187, 188]. 

Finally, we briefly introduce another strategy to deal with heavy quarks on the 
lattice and the related multi-scale problem [182, 183]. Here the condition m; L > 1 
in Eq. (5.224) is sacrificed in favour of am, «& 1. In this way one is able to 
accommodate a fully relativistic b-quark at the expense of having to deal with 
strong finite-volume effects. The key observation is that the "distortion" due to 
unphysically small volumes can be computed in a series of finite-size scaling steps, 
which relate the results obtained on a sequence of lattice sizes Lo, L1, .... Like in 
the case of the non-perturbative determination of the RG running of the coupling 
and the quark mass discussed in Sect. 5.5.2, one can set up a recursive finite-size 
scaling procedure, which traces the volume dependence of observables. Here it is 
mostly sufficient to apply two or three steps in the scaling sequence. 

In the remainder of this section we shall discuss some selected results. Regarding 
the vast number of individual results, we do not attempt to provide a complete 
review of the current status of lattice calculations of weak matrix elements in the 
heavy quark sector. Regular appraisals of the progress made in studying these 
systems can be found in the rapporteur talks on the subject at the annual conferences 
on lattice field theory [132, 133, 135]. Instead we shall discuss the relation between 
CKM matrix elements and the quantities that must be computed in order to extract 
the former from experimental data without resorting to model assumptions. 


Heavy-Light Decay Constants From Eq.(5.211) and Fig. 5.18 one infers that 


the ratio € = fp, Bp, / fa Bg of decay constants and B-parameters is a key 


quantity, since it links AM,;/AMg to the ratio [Vis | /| Va? of CKM matrix 
elements. Typically, one determines decay constants and B-parameters separately, 
since the former can be easily extracted from hadronic two-point functions, while 
the latter may undergo complicated mixing patterns, depending on the fermionic 
discretization. The decay constant of, say, a B^ meson, is defined via the matrix 
element of the heavy-light axial current, i.e. 


fgmg = (0|(ayoysb)| B”). (5.232) 
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If the matrix element on the right-hand side is computed in a lattice simulation, then 
the axial current defined in the discretized theory must be matched to its counterpart 
in the continuum formulation. The details of the matching procedure depend on the 
type of fermionic discretization and the chosen treatment to represent the heavy- 
light axial current on the lattice (e.g. static approximation, NRQCD, etc.). If the 
b-quark is treated in the static approximation, the axial current has a non-vanishing 
anomalous dimension, and hence its running must be determined as well. Therefore, 
the various techniques which have been developed to compute the renormalization 
factors of local operators non-perturbatively, are of particular relevance also in the 
study of heavy-light decay constants [189]. In particular, non-perturbative estimates 
for the renormalization factor of the axial current, Za, are required to ensure a 
smooth convergence towards the continuum limit. 

We now present results for fg and fp,. From Chiral Perturbation Theory one 
expects that the bulk of the SU(3)-flavour breaking effect in & (i.e. the deviation 
of € from unity) is carried by the decay constants. The full expression at NLO for 
Js. /fp reads [190] 


DU es (mg — m2) fo(w) — 


n 
fb 


1 3 
rhy zP) + ge) = iren ; 
(5.233) 


where Ip(mps) = Mog In(m25/ 2) and f is a low-energy constant, and g? is the 
strength of the B* Bz vertex. As was pointed out by Kronfeld and Ryan [191], 
the contribution from the chiral logarithms can be sizeable, so that a naive linear 
extrapolation of lattice data from the region of the strange quark mass tends to 
underestimate fp, / fg. By contrast, the corresponding ratio Bp, / Bg is expected to 
be close to one, since the coefficient of the chiral logarithm nearly vanishes. Since 
f8,/ fa enters directly into fits to the CKM parameters, many attempts were made 
to pin down its value precisely. As in the case of f/f; discussed earlier, the main 
issue for lattice calculations is whether the quark masses employed in simulations 
are small enough to allow for a controlled chiral extrapolation based on the NLO 
formulae. The influence of the chiral logarithms has so far been detected only in 
simulations based on Nf = 24-1 flavours of rooted staggered quarks. Using NRQCD 
to treat the b-quark, the authors of [192] find 


fs 12040034001, — Ni 2241, (5.234) 


B 


where the first error is statistical, while the second is an estimate of the systematic 
uncertainty. This result awaits confirmation from simulations with sea quark masses 
as small as those used in [192], but employing different fermionic discretizations, 
both in the sea and valence quark sectors. This is of particular relevance, since the 
typical spread among the recently published results is of the same order or even 
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larger than the uncertainty quoted above. Further discussions and compilations of 
lattice data for fp, / fg can be found in [133, 193]. 

Estimates for absolute values of heavy-light decay constants are also highly 
desirable, especially since fg is hard to determine experimentally, even at the B- 
factories, since the B — tv, decay rate is suppressed. For fg, the suppression 
is even stronger, and thus the prospects for an experimental determination of this 
quantity are extremely uncertain. The main issues facing lattice calculations are the 
influence of lattice artefacts in conjunction with the renormalization of the axial 
current, and the dependence of results on the number of dynamical quark flavours. 

As an example for one of the most advanced quenched calculations for fp, 
we shall briefly discuss the result by the ALPHA collaboration [198], which also 
illustrates the interplay between various methods to treat the b-quark. In Ref. [198] 
the results obtained in the static approximation were combined with data computed 
around the mass of the charm quark. Provided that estimates for the decay constants 
in both datasets have been extrapolated to the continuum limit, a subsequent 
interpolation in the heavy quark mass yields the desired result for fg,. The ansatz 
for the interpolation is based on the expression 


ó 
fes/mps = Cps(M/ ^x) Y (1 + ŽŽ) ; (5.235) 


mps 


where fps is a generic heavy-light decay constant, y, ó are real constants, and the 
factor Cps arises from the matching between the static approximation and QCD 
with fully relativistic quarks. Thus, using the static approximation as the limiting 
case removes the systematic error due to the uncontrolled extrapolation to the mass 
of the b-quark. The resulting estimate for fp, is [198] 


fa, = 19346MeV,  Np=0. (5.236) 


Non-perturbative renormalization has been employed in both the static approxima- 
tion and the relativistic formulation. Except for the unknown systematic error due 
to quenching, the quoted error contains all uncertainties. The above result has been 
confirmed by the approach based on the finite-size scaling method [199]. 

Turning now to unquenched simulations, we compare the above value to the 
result by the HPQCD Collaboration [192], which was obtained using NRQCD for 
the b-quark, while Nf = 2 + 1 rooted staggered quarks were used as sea quarks. 
Here, the estimate for fg, results from a combination of the value for fg and the 
ratio fp,/fp already quoted in Eq. (5.234). In this way one obtains 


fg, = 259 + 32 MeV. (5.237) 
Thus, in spite of the large error, it appears that the inclusion of dynamical quark 


effects increases the estimate for heavy-light decay constants. This is also supported 
by other simulations. For instance, using their simulation results in quenched QCD 
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Fig. 5.20 Recent lattice results for fp,. From left to right the results are taken from the following 
papers. Ne = 0 : [194-207]; Ne = 2 :[203, 204, 207, 208], [209]; Nr = 2 + 1 :[192, 210]. 
The labels indicate the method used to treat the b-quark in the simulation (“ext” and "int" stand 
for extrapolations and interpolations to the mass of the b-quark, respectively. The horizontal lines 
represent the (non-lattice) result from [130], with the quoted uncertainty 


and with Nr = 2 flavour of dynamical Wilson quarks, the CP-PACS Collaboration 
find [203] 


Ls = 1.14 + 0.05. (5.238) 


The “non-lattice” determination of fg, via fits using the experimental results for the 
angles of the unitarity triangle as input [130] also point to a larger value compared 
to the quenched theory, as can be seen from the horizontal band in the compilation 
in Fig. 5.20. 

In current unquenched simulations, systematic effects such as lattice artefacts 
and the renormalization of local operators are not yet controlled at a similar 
level compared to the quenched approximation. Thus, despite the fact that these 
calculations are much more “realistic” in that they include sea quarks, the quoted 
overall uncertainties are still relatively large. 


B-Parameters Bs, and By, Following the recent experimental determination 
of the mass difference AM, at the Tevatron [211, 212], lattice determinations of 
the B-parameters Bs, and Bs, have received much attention. Although the first 
calculations date back to the 1990s, relatively few results are available, due to 
several specific technical difficulties. First, the complicated renormalization and 
mixing patterns of four-quark operators which afflict lattice calculations of the kaon 
B-parameter Bx are also encountered in the b-quark sector. Second, there is the 
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Table 5.6 Published lattice results for the B-parameter Bp, (mp) in the MS-scheme and the ratio 
Bp, / Bg, 


Collaboration Ny Action Bg, (mp) Bz, / Bg i 

Gadiyak et al. [213] 2 DWF 1.06(6)(3) static 
JLQCD [209] 2  |Clove | 0.836(27)8) |1.0171909) | NRQCD 
Gimenez et al. [214, 215] 0 Clover 0.81(5)(4) 1.01(1) | Static 
UKQCD [215, 216] 0 Clover | 0.79(4)(4) 1.02(2) | Static 
Christensen et al. [217] 0 Wilson  |0.98(4) 0.99(1)(1) Static 
SPQ&R [218, 219] 0 Clover 0.87(4)) 0.99(2) rel /ext. 
UKQCD[195] - 0 | Clover l 0.98(2)(9) rel /ext. 


The method to treat the heavy quark is specified in the last column 


added complication which arises from the fact that the b-quark cannot be simulated 
directly. 

In Table 5.6 we list published results for Bg, (mp) and the ratio Bg, / Bg, from 
a variety of methods to treat the heavy quark. The table shows that all results 
are broadly consistent with each other at the level of 10%, despite the different 
systematics. Moreover, none of the listed estimates is based on non-perturbative 
renormalization factors, and furthermore all entries have been computed for a 
fixed value of the lattice spacing, i.e. a systematic study of the continuum limit 
is lacking even in the quenched approximation. As for the ratio Bp, / Bp,, it should 
be mentioned that the quark masses in the simulations correspond to pion masses 
not much smaller than 500 MeV. However, in view of the fact that the bulk of the 
relevant SU(3)-flavour breaking effect in AM; /AMg is expected to come from the 
ratio of decay constants, fp, / fp,, this may not be such a serious limitation. Results 
for Bg, and Bg, computed on dynamical gauge configurations with rooted staggered 
quarks should be published soon. 

Another recent development is the implementation of non-perturbative renormal- 
ization for heavy-light four-quark operators in the static approximation [220, 221]. 
If the b-quark is treated in the static approximation, the AB = 2 four-quark operator 
must be matched to its counterpart in the static theory, i.e. 


Q^? (mp) = Cy (ms, u) Qi (u) + Csm, i) O3 (0). (5.239) 


where 


Qi- (Vnyu( — ys)£) (vg yu (1 — ys)4) = Ovv+aa + OVAL AV 
Or = (Vn (1 — y3£) (V; ( — ys)t) = Osssp» + Osp.evs. (5.240) 


with £ denoting the light (d or s) flavour. For the physical matrix element only 
the parity-even operators OVV+LAA and Üss..pp are relevant. If chiral symmetry 
is not preserved by the discretization, four-quark operators such as ÜÓvV.LAA 
undergo complicated mixing patterns under renormalization, which necessitate 
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finite subtractions similar to those required for the operator Ovy+aa in Eq. (5.215). 
However, just as in the case of K? — K? mixing, the parity-even operators Ovv+ AA 
and Oss..pp can be mapped onto their parity-odd counterparts Ova+ Av and Osp+ps 
by a flavour rotation, which realizes the transition to tmQCD at maximal twist angle. 
Moreover, it can be shown [220] that the combinations 


O! = Ova AN. O, = Ovarav + 40spips (5.241) 


renormalize purely multiplicatively. The RG running of these operators, as well as 
the matching to hadronic schemes based on tmQCD have been determined non- 
perturbatively in the SF scheme for Ne = O [221] and Ne = 2 [222], which 
will eventually allow for a determination of Bs, and Bg with full control over 
renormalization and discretization effects. Corrections of order 1/mp can be taken 
into account through an interpolation between the results obtained in the static 
approximation and for relativistic heavy quarks with masses in the region of that 
of the charm quark. 


Semi-Leptonic B-Decays The CKM elements |V,,,| and |Vep|, which appear in 
the unitarity triangle relation equation (5.210), can be extracted from both inclusive 
and exclusive B-meson decays. However, |V,,,| is still one of the most poorly 
constrained CKM elements. Its value can be determined by combining lattice 
calculations of semi-leptonic form factors for exclusive decays such as B? — 
xte, with the experimentally measured decay rate. If the leptons are assumed 
to be massless, the latter yields the combination [| V,» | fP, while the form 
factor f} (q?) can be extracted from the matrix element 


> z > m -m 2 mg -m 2 
(nz) | (byw) (0)| B(pp)) = | (PB + Pa) — d f (q^) + qua fog ). 


(5.242) 


Here, qu = (pp — Px), denotes the momentum transfer. For a B-meson at rest one 


has 
q? = m} + mà —2mg m2 + p2. (5.243) 


In order to avoid large lattice artefacts, typical values of the pion momentum in 
simulations are restricted to 


lPz| S 1GeV. (5.244) 


Therefore, lattice calculations typically yield the form factors f, and fo near 
qg = dax By contrast, the bulk of the experimental data is recorded in bins 
with small values of q?, since the decay rate is suppressed near TNR Therefore, 


an extrapolation to small values of q? must be performed, which requires an 
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Fig. 5.21 Form factors f4 3.0 oot 
(upper data set) and fo for p A UKQCD (1999) NR 
B — mv decays (taken from | w.-0 V Abada etal. (2000) ] 
Ref. [224]). The data are | f > El-Khadra et al. (2001) J 
taken from Refs. [225] 20 F < JLOCD (2001) 
(UKQCD), [226] (Abada et [y =241 © Fermilab (2004) 
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ansatz for the shape of the form factor. Although a parameterization of the q?- 
dependence which goes beyond vector pole dominance and is also consistent with 
the expected heavy-quark scaling laws has been proposed [223], the extrapolation 
to small momentum transfers typically introduces some model dependence in the 
result for | Vip]. 

Figure 5.21 shows a compilation of lattice data for the form factors as a 
function of q? together with the curves which represent the extrapolations to 
q? = 0. The problem of the model dependence introduced by the extrapolation 
to small momentum transfer can be avoided by combining form factors from lattice 
simulations with the decay rate measured in restricted intervals of q?, which overlap 
with the range of momentum transfers that are directly accessible in simulations. 
Such a procedure has been performed by the CLEO Collaboration [231]. The result 
for |V,,5| obtained in this way is somewhat smaller compared to the standard method 
based on form factor extrapolations, but the uncertainties are still quite large. For the 
actual estimates of |V,,,| obtained in this way, the reader may consult the original 
papers. 

Semi-leptonic heavy-to-heavy decays such as B. — (D, D*)£v, offer a way 
to determine |Vep|. In this case it is convenient to use the four-velocities of the 
two mesons as the kinematical variables instead of the four-momenta. The decay 
amplitudes are then parameterized in terms of six form factors, i.e. 


(D(v’) |(éy“b)| BQ)) 
/mpgmp 
(D*(, €) |y" b)| B(v)) 
[mgmp 


(D* Q^, e) (Cy ysb)| BQ)) 


/mgmp 


= (v + v')" hi (w) + (v — v) h (w) 


= ie VB ey! ughy(w) (5.245) 


= (w+ le“ hA (o) — €*- v [v hA, (w) + vha, (@)], 
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where œw = v-v’. In the limit of infinite heavy quark mass, four out of these six form 
factors can be replaced by a single, universal form factor, £(c), which is called the 
Isgur-Wise function [232] 


mp,mo;— oo => hyo) =ha,(@) = ha, (w) = hy(o) = £(o), (5.246) 


while h_(w) and ha, (œw) vanish as mp, me become infinitely heavy. Outside the 
exact heavy-quark limit, the relation between the Isgur-Wise function and the form 
factors is modified. For instance, 


h+ (w) = (1+ f+ (0) + y. (0)) Elw), (5.247) 


where 61, y+ parameterize radiative corrections and corrections arising from 
operators of higher dimension, which are suppressed by additional inverse powers of 
the heavy quark mass. Similar relations hold for ha,, ha, and hy. Another important 
result, known as Luke's Theorem [233], states that at zero recoil, v = v’, i.e. œ = 1, 
the leading corrections to the form factors h+ and ha, are quadratic in the inverse 
heavy quark mass. 

With this setup one may devise a strategy to determine |V.5| by combining the 
experimentally determined decay rate with lattice calculations of the form factors. 
The differential decay rate for B — D*£v; in the limit of zero recoil reads 


1  dr(B > D*£v) 402 d.d 5 
l —— ————————— = |V — — * n [A 1 $ 
siyo] do EHE ga O ia AA 


(5.248) 


which, owing to Luke’s Theorem, receives corrections of order 1/ m? only. For 
œw > | the single axial form factor ha, must be replaced by a linear combination 
of several form factors. Thus, the theoretical uncertainties appear to be controlled 
best at zero recoil. Since the rate is suppressed near c = 1, the measured decay 
rate must be extrapolated to that value to determine |V¿p|. Most of the published 
lattice calculations of the form factors and the Isgur-Wise function [234—238] are 
therefore focused on the determination of the slope of £ (w) at o = 1. The measured 
decay rate can then be extrapolated to zero recoil using a particular parameterization 
of &(@), with its slope constrained via the lattice calculation. After taking radiative 
and power corrections into account, a value for |V¿p| can be extracted. 

A different but related strategy is to compute the form factors h4 (1) and ha, (1) 
directly via suitably chosen double ratios of hadronic matrix elements in which 
many systematic effects can be expected to cancel [239, 240]. Using the “Fermilab 
approach" for the heavy quarks in the quenched approximation, the authors of 
Ref. [240] find 


ha sols t (5.249) 
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where the first error is statistical, while the second represents an estimate of various 
systematic uncertainties added in quadrature. Again, this result can be combined 
with the experimental decay rate to determine |V.;|. More details can be found in 
[240]. 

Most lattice studies of heavy-to-heavy semi-leptonic B-decays have been 
restricted to the quenched approximation. However, results for the form factors 
from dynamical simulations can be expected in the near future. Clearly, in order 
to have maximum impact on the determination of |Vep|, systematic effects arising 
from lattice artefacts and the formulation used to treat the heavy quark must be 
controlled to a high degree. 


5.8 Concluding Remarks 


In this article we have introduced the lattice approach to QCD and discussed 
a variety of applications, which range from hadron spectroscopy, confinement, 
quark masses and the running coupling, to spontaneous chiral symmetry breaking 
and hadronic matrix elements for flavour physics. This illustrates not only the 
versatility of the lattice method, but also indicates that lattice calculations have 
become ever more important for making quantitative predictions in the notoriously 
difficult sector of non-perturbative QCD. Still, a great number of other applications 
have not even been covered here, including nucleon structure functions and form 
factors, calculations at finite temperature and/or chemical potential, or detailed 
investigations of the QCD vacuum structure. 

That lattice calculations have reached this standing is owed to the enormous 
progress which been made in developing more efficient algorithms for dynamical 
fermions, better discretizations, as well as a number of new theoretical concepts 
such as non-perturbative renormalization. These developments, in conjunction 
with the availability of ever more powerful computers, shall allow for precise 
computations of many phenomenologically relevant quantities, which previously 
seemed virtually intractable. 


5.9 Addendum: QCD on the Lattice 


5.9.1 Introduction 


Since the first edition of this article [241] the field of lattice QCD has undergone 
a huge transformation. While the actual methodology was well established at 
the time of writing (2007), few simulations employing dynamical quarks had 
produced results with controlled errors, having a direct impact on phenomenology 
and experiment. During the past ten years or so this has changed dramatically. 
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Simulations with light dynamical quarks, whose masses correspond to the physical 
value of the pion mass, have become the state of the art, and the effects of 
dynamical strange and charm quarks are now routinely included as well. In fact, 
lattice calculations of certain observables have reached (or are aiming for) a level 
of precision where the effects of the breaking of isospin symmetry can no longer 
be ignored. This necessitates that lattice QCD must account not only for the effects 
of unequal u and d quark masses but also for corrections due to electromagnetism, 
owing to the different electric charges of up- and down-type quarks. 

In this context it is interesting to quote a remark by Ken Wilson, made at the 
1989 International Conference on Lattice Field Theory [242]: “I still believe that an 
extraordinary increase in computing power (108 is I think not enough) and equally 
powerful algorithmic advances will be necessary before a full interaction with 
experiment takes place.” Given that, in 1989, the most powerful supercomputers 
could sustain 10 GFlops (i.e. 10!? floating point operations per second), Wilson's 
estimate was tantamount to requiring ExaFlops capabilities (10!? Flops) for lattice 
QCD to make an impact, a performance figure that has only been reached very 
recently by less than a handful of machines. The enormous progress that the field 
of lattice QCD has already seen over the past decade proves that Wilson’s view was 
far too pessimistic.?! For instance, results from lattice calculations for the decay 
constants and form factors of mesons and baryons containing heavy quarks are vital 
input for global analyses of observables in flavour physics, designed to constrain the 
elements of the Cabibbo-Kobayashi-Maskawa matrix. Furthermore, lattice QCD 
yields precise values for the masses of the light (u, d, s) quarks [244]. 

An impressive testimony to the importance of lattice QCD for the entire field 
of particle physics is the regular report provided by the Flavour Lattice Averaging 
Group (FLAG). Since its inception in 2007, FLAG has been charting the progress 
in lattice QCD, by collecting results for a range of phenomenologically relevant 
quantities. Taking inspiration from the Particle Data Group, FLAG assesses the 
quality of individual calculations and produces world averages by combining those 
results that satisfy a defined set of requirements regarding the overall control over 
systematic effects. Three editions of the FLAG report, published in 2010 [245], 
2013 [246] and 2016 [247], have appeared until now, and a fourth one has been 
published in 2019 [248]. In fact, the current status of lattice calculations of many 
observables that have been reviewed in the first edition of this article can be found 
in these comprehensive reports. 

This short review is organized as follows. In Sects. 5.9.2 and 5.9.3 we give 
an update of lattice calculations applied to hadron spectroscopy, weak hadronic 
matrix elements and the determination of Standard Model parameters such as quark 
masses and the strong coupling constant. These quantities were covered extensively 
in the original edition of [241]. Then, in Sect. 5.9.4 we extend the discussion to 
the determination of quantities that describe structural and other properties of the 
nucleon, such as form factors and the axial charge. Finally, in Sect. 5.9.5 we discuss 
lattice calculations of the hadronic contributions to the muon anomalous magnetic 


?!Éven Wilson himself acknowledged, at least partially, that this was the case [243]. 
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moment, which is a key quantity to study possible deviations from the Standard 
Model. The review concludes with a few remarks on the progress achieved over the 
past decade and an outlook for future calculations. 


5.9.2 Hadron Spectroscopy 


The calculation of the light hadron spectrum, i.e. the masses of the lowest-lying 
mesons and baryons has long been regarded a benchmark for lattice QCD. In the 
quenched approximation, i.e. in the absence of dynamical quarks, a significant 
deviation between the calculated spectrum and experiment at the level of 10-15% 
was observed. When the light hadron spectrum could eventually be accurately 
reproduced within the overall uncertainty after the inclusion of light dynamical 
quarks [249—252] (see Fig. 5.22), this was hailed as a major success of lattice QCD. 
Thanks to these milestone results, the credibility of lattice calculations was firmly 
established throughout the particle and hadron physics communities. 

Calculations of the light hadron spectrum have since been further refined, by 
taking the effects of isospin breaking into account. Strong isospin breaking arises 
from the mass splitting between the u and d quarks, m, 4 mg. Since the electric 
charges of u and d quarks differ as well, electromagntic interactions are another 
source of isospin breaking. The formulation of QED on a lattice of finite volume 
poses considerable technical challenges since the photon is massless. There are 
several strategies to address the problem of the associated zero mode, and we refer 
the reader to recent reviews of the subject [253—255], which also serve as a guide to 
the literature. 

After the inclusion of strong and electromagnetic isospin breaking effects, it 
became possible to perform another benchmark calculation, namely the accurate 
determination of the neutron-proton mass difference, as well as the mass splittings of 
other baryonic iso-multiplets [256—259]. The ability to determine isospin breaking 
effects arising from QED was also instrumental for calculations of the electromag- 
netic mass splittings of pions and kaons [260-265], which can be used to study 
violations of Dashen's theorem [266]. The latter states that the electromagnetic self- 
energies of the charged pions and kaons are identical, while those of their neutral 
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counterparts vanish. More details are found in section 3.1.1 of the FLAG report 
[247]. 

Another recent focus of lattice spectroscopy has been the determination of the 
excitation spectrum and the properties of hadronic resonances. This is a major 
refinement of previous calculations in which the masses of resonances (the simplest 
being the o-meson) were extracted naively from the exponential decay of the vector 
correlation function, thereby ignoring the fact that resonances are characterized 
both by a mass and a width. The general framework for the study of resonance 
properties in lattice QCD was developed by Lüscher already in the 1980s and 
1990s [267—270], and it is only now that the potential of this elegant and powerful 
formalism can be fully exploited. The key idea that underlies the Lüscher method 
is the realization that computing the energy levels of multi-particle states in a finite 
volume gives access to the scattering phase shifts in infinite volume, provided that 
the spectrum (including excited states) can be determined sufficiently well for a 
range of kinematical situations. The latter are typically determined by the lattice 
volume and/or the total momentum of the multi-particle system in question. 

To be more specific, let us consider the simplest resonance, the o-meson, whose 
properties can be accessed in p-wave zz scattering. For energies below the inelastic 
threshold, the Lüscher condition reads 


kL 


$(q) 4-91(k) 20 modz, q =>, 
2x 


(5.250) 


where $(q) is a known kinematic function of the scaled scattering momentum 
in units of the box size, q = kL/2z and 9; is the scattering phase shift. The 
scattering momentum k is determined from the nth energy level w, in a finite 


volume, according to 
On = m2 + K?, (5.251) 


where m; is the pion mass. Figure 5.23 shows an example of a calculation of the 
p-wave scattering phase shift as a function of the centre-of-mass energy [271]. 
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A crucial ingredient for the reliable determination of not just the energy level of 
the ground state but also the excitation spectrum is the use of correlator matrices 
computed using a suitable basis of interpolating operators (see Section 5.3 in 
Ref. [241]). The diagonalization of the correlator matrix can be achieved by solving 
a generalized eigenvalue problem from which the energy levels in a given channel 
can be determined [272-274]. The sometimes arduous task of constructing efficient 
interpolators for multi-particle states has been helped enormously by practical 
methods to compute "all-to-all" quark propagators [275] and, in particular, the 
so-called "distillation" technique [276, 277]. With these new developments it has 
been possible to perform lattice investigations of z7 scattering and the p resonance 
[278-291], as well as determinations of Kx [292, 293] and KK scattering lengths 
[294, 295]. The formalism has also been used to study meson-baryon [296—300] and 
baryon-baryon [301, 302] interactions. 

While the original Lüscher formalism was derived for the case of elastic two- 
particle scattering, it has now been generalized to coupled-channel systems [303- 
307], including the treatment of three-particle thresholds [308—315]. It also opens 
the possibility to study weak non-leptonic kaon decays [316] and compute form 
factors for timelike momentum transfers [317—320]. 

Moreover, the experimental discovery of new charmonium-like resonances, 
commonly referred to as the X, Y and Z states, has kindled a new interest in 
hadron spectroscopy. A distinctive feature of the new resonances is their closeness to 
particle thresholds, and efforts are underway to gain a detailed understanding of the 
resonance structure in the charm sector. Using the formalism described above, there 
have been many calculations of a variety of charmonium-like resonances in lattice 
QCD. In view of the vast literature, we refer the reader to several recent reviews of 
the subject [321—323]. 


5.9.3 Parameters of the Standard Model 


The Standard Model (SM) contains 19 parameters (excluding the neutrino sector) 
whose values are not predicted by the theory itself but must instead be fixed using 
experimental input. In many cases the relations between experimentally accessible 
observables and SM parameters involve quantities that encode the effects of the 
strong interactions. A well-known example is the kaon B-parameter By that 
enters the relation between the quantity ey, which is a measure of indirect CP 
violation, and a particular combination of Cabibbo-Kobayashi-Maskawa (CKM) 
matrix elements V,;4, Vis, i.e. 


ex « Bx Im (Via VË). (5.252) 
While eg can be determined experimentally from a ratio of decay amplitudes 


of long- and short-lived K-mesons, KŁ, ,s — (7) =o, the parameter Bx must 
be extracted from the hadronic matrix element of a four-quark operator between 
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K and K? states. Obviously, such a calculation must be performed in the non- 
perturbative regime of QCD since it involves typical hadronic scales. 

Other CKM matrix elements, such as V,,, Vıp and Vep are related to weak 
processes involving kaons, D- and B-mesons, which are described by a variety 
of leptonic decay constants (and their ratios), form factors of semi-leptonic meson 
and baryon decays, as well as the B-parameters that encode strong interaction 
contributions to BÓ — B? and BÜ — B? mixing. All these quantities have been studied 
in lattice QCD for many years, and increasingly precise estimates with controlled 
systematic errors have appeared over the past decade. They have been instrumental 
for recent analyses of the unitarity of the CKM matrix [324—327]. 

Similar considerations apply to SM parameters such as the strong coupling 
constant o; and the masses of the quarks. While the asymptotic scaling behaviour 
of o, gives rise to the dimensionful A-parameter that encodes the intrinsic scale 
of QCD, the quark masses are external parameters. Providing the link between 
experimentally accessible quantities and quark masses, as well as expressing the 
A-parameter in units of some measurable low-energy quantity has been a primary 
task for lattice QCD. Lattice calculations have also be instrumental for determining 
the coupling constants of effective descriptions of QCD, such as the low-energy 
constants of Chiral Perturbation Theory. 

The importance of accurate, model-independent determinations of SM parame- 
ters and input quantities for flavour physics has led to the foundation of the Flavour 
Lattive Averaging Group (FLAG). Updates of the FLAG report have appeared at 
regular intervals since the publication of its first edition in 2010 [245]. As part of its 
mission, FLAG issues global estimates and averages of lattice results, provided that 
they satisfy a set of defined quality criteria. FLAG estimates are quoted separately 
according to the sea quark content of the calculations that enter the global analyses, 
ie. whether they have been obtained with a degenerate doublet of u, d quarks 
(Ny = 2) or with an additional dynamical strange (Nf = 2 + 1) and charm quark 
(Nf = 2-4- 12-1). The current status of lattice QCD calculations of quark masses, the 
strong coupling, decay constants, form factors, mixing parameters and low-energy 
constants is summarized in Tables 1 and 2 of the 2016 FLAG report [247]. The 
FLAG webpage? contains additional updates. Below we comment on the current 
status of a few selected quantities. 


Quark Masses According to FLAG, the strange quark mass is known to 196 
precision, while the accuracy in the determination of the average u and d quark 
mass, m = in, + mq), varies between 1—5 96, depending on the sea quark 
content [328—332, 332—340]. Thanks to the recent progress in including the effects 
of isospin breaking in lattice QCD calculations, estimates for the masses of the 
individual u and d quarks could also be obtained, typically with 2 — 5 % precision 
[261, 262, 264, 330]. Furthermore, the masses of the heavy quarks have been 
determined with excellent precision [328, 330—332, 335, 337, 341—348]. 


>? http://flag.unibe.ch/. 
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Running Coupling A milestone was achieved by the ALPHA collaboration, who 
published [349] an estimate for o; (M 2) obtained by tracing the scale evolution 
of the strong coupling non-perturbatively over several orders of magnitude into an 
energy range where the application of perturbation theory can be considered safe (at 
least as far as the quoted precision is concerned). Their main result is the determina- 
tion of the A-parameter in three-flavour QCD, i.e. ARE — 341(12) MeV, which can 
be matched to the A-parameter in the five-flavour theory using perturbation theory, 
giving AD. — 215(10)(3) MeV. Finally, this is translated into the result for the 
strong coupling [349]: 


oS (M2) = 0.11852(84). (5.253) 


The quoted error is 30% smaller than that of the 2016 PDG estimate of as = 
0.1181(11) [244]. The latter includes lattice results from Refs. [331, 335, 350—354]. 


Kaon Weak Matrix Elements The kaon B-parameter By is now known with 
an overall accuracy of 1.3% [336, 355—359]. Moreover, the calculations of matrix 
elements relevant for K? — K? mixing have been extended to include operators that 
arise in extensions of the Standard Model [355, 358—363]. 

Lattice QCD results for kaon leptonic decay constants (more precisely: the 
ratio fx+/f,+) and the form factor f,(0) describing semi-leptonic K — lve 
decays have now reached a level of precision that enables a competitive and model- 
independent determination of V„s (see Sect. 5.7.1 of the original review article). 
Moreover, it is possible to test the unitarity of the first row in the CKM matrix, i.e. 
the relation 


IVa? + |Vus|* + |Vuo? = 1, (5.254) 


by combining experimental information with lattice results for f£. (0) and f/f. 
Neglecting the contribution from |Vubl? z~ 1.7- 1075, one finds that IVual? + [Vus |? 
can be determined with a total precision at the percent level, by combining the 
FLAG estimates? for f,(0) and fx+/f,+ with the experimentally accessible 
combinations |V,,| f. (0) = 0.2165(4) and |Vus/Vualfg+/fz+ = 0.2760(4) 
[244, 364]. In QCD with dynamical light, strange and charm quarks (Nf = 2+1+1) 
the result is \Vual? + |Vus |? = 0.9797(74), which signals a slight tension of 
2.7 standard deviations with the Standard Model. The precision of the unitarity test 
can be sharpened considerably by replacing |V,4| with the value extracted from 
neutron f-decay, i.e. |V,4| = 0.97417(21) [365]. It is then sufficient to provide 
one additional constraint from lattice QCD, either in the form of f+(0) or the 
ratio fx+/f_+. Inserting the lattice result for f,(0) yields |V,4]? + |V4,|?. = 
0.99884(53), which again differs from unitarity by about 20. Using instead the 


23 See the web update at http://flag.unibe.ch/. 
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lattice result for fx+/f,+ implies | Vas? + IVa? — 0.99986(46). Thus, first-row 
unitarity can be probed with permil-level precision [247]. 


Heavy-Light Decay Constants and Form Factors The treatment of heavy quarks 
on the lattice presents additional significant challenges: since the mass of the charm 
quark is close to typical values of the inverse lattice spacing, which acts as the 
ultraviolet cutoff, lattice results are prone to suffering from large discretisation 
errors. Moreover, the mass of the bottom quark exceeds currently accessible values 
of a^! , and specially designed methods are required for a consistent treatment. This 
has been discussed extensively in Sect. 5.7.2 of the original review. 

The overall precision of lattice estimates for weak hadronic matrix elements 
involving charm and bottom quarks has vastly improved over the past decade. As 
shown in Table 2 of FLAG 2016 [247], the leptonic decay constants of the B and B; 
mesons are now known at the level of 2%, while ratios such as fp,/fp have been 
determined with even better accuracy [347, 366—373]. Since the 2016 edition of the 
FLAG report, new results obtained with Nf = 2+1-+1 flavours of dynamical quarks 
[343, 374, 375] have pushed the overall precision to the sub-percent level, which is 
an impressive achievement. Also the estimates of the individual B-parameters Bp 
and B.. their ratios and combinations with the leptonic decay constants are now 
known with overall errors at the percent level [347, 370, 376, 377]. 

Results for form factors describing semi-leptonic decays of hadrons containing 
b-quarks, such as B — (D, D*)€v, or even Ap —  p£v have reached a 
level of precision that is sufficient for competitive determinations of the CKM 
matrix elements Vep and V,, from exclusive processes. An extensive discussion 
is presented in the web update of the FLAG report. 


5.9.4 Nucleon Matrix Elements 


The understanding of the internal structure of the nucleon in terms of the funda- 
mental interactions between its constituents, the quarks and gluons, has become a 
major activity within the field of lattice QCD. Structural information is encoded 
in quantities such as form factors, structure functions and (generalized) parton 
distribution functions (PDFs). An open problem in this context is the decomposition 
of the proton's spin in terms of the spins of quarks and gluons, as well as their 
angular momentum [378, 379]. Another important issue is the so-called "proton 
radius puzzle" [380], which arises due to the observed discrepancy between the 
proton radius extracted from the Lamb shift in muonic hydrogen [381, 382] 
compared to the more traditional determinations from electron-proton scattering 
[383] or the Lamb shift in electronic hydrogen [384]. Accurate knowledge of the 
electromagnetic form factors of the proton are indispensable in order to resolve—or 
corroborate—this puzzle. 

The determination of quantities such as nucleon form factors in lattice QCD pro- 
ceeds by calculating the corresponding hadronic matrix elements between nucleon 
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initial and final states. A strong motivation for computing such quantities is provided 
by the fact that fundamental interactions are often probed in scattering experiments 
involving nuclear targets. For instance, probing the neutrino sector requires accurate 
knowledge of the scattering cross sections of neutrinos with nuclear targets. Similar 
considerations apply to the search for dark matter candidates. Therefore, precise 
determinations of the corresponding nucleon matrix elements are indispensable for 
exploring the limits of the SM. 

The past decade has seen a huge rise in the number of publications describing 
lattice calculations of nucleon matrix elements. Quantities that have been studied 
include 


* the electromagnetic form factors of the nucleon, G EC(Q?) and G u (Q^, which 
give access to the electric and magnetic charge radii of the nucleon and its 
magnetic moment [385—397]; 

e the iso-vector axial charge of the nucleon, g4, which is a measure of the strength 
of weak interaction in neutron -decay [386, 387, 389, 392, 397-414], as well as 
the scalar and tensor charges, gs and gr [386, 393, 404—406, 411, 412, 414—419]; 

* axial and induced pseudoscalar form factors of the nucleon [397, 407, 409, 420, 
421], as well as the strange electromagnetic and axial form factors [421—426, 
529] which probe the quark sea inside the nucleon; 

e the pion-nucleon o-term ory [412, 427—438] and the strange content of the 
nucleon oy [412, 429-431, 435—444]. These o-terms are proportional to nucleon 
matrix element of the flavour-diagonal scalar density, gg, which parameterizes 
the rate of change in the nucleon mass due to a non-zero value of the correspond- 
ing quark mass. 


Recent reviews, presented at the annual conference on lattice field theory, can be 
found in Refs. [445—447]. Some results on nucleon form factors and other matrix 
elements are reviewed in section 3.2.5 of [448], and a dedicated chapter has been 
prepared for the 2019 edition of the FLAG report. In addition, there has been a 
community effort in the form of a white paper [449] in which lattice results are used 
to reduce the overall uncertainties in polarized and unpolarized proton PDFs and 
their moments. 

The relevant nucleon hadronic matrix elements are extracted from suitable 
three-point correlation functions of quark bilinears between interpolating operators 
representing the initial and final-state nucleons. Examples of the corresponding 
diagrams, with the initial-state nucleon placed at Euclidean time t = 0 (the source), 
the final-state nucleon at time t, (the sink) and the operator insertion at time f, 
are shown in Fig. 5.24. In addition to the quark-connected diagram, in which the 
operator is inserted on a valence quark line, there are also quark-disconnected 
diagrams in which the operator probes the quark sea. The latter class of diagrams 
must be computed to determine, for instance, iso-scalar quantities, the strangeness 
form factors and the o-terms. 

Precise determinations of nucleon matrix elements with controlled statistical and 
systematic errors are particularly challenging. This is a consequence of the fact that 
the noise-to-signal ratio in three-point correlation functions corresponding to the 
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Fig. 5.24 Quark-connected (left) and disconnected (right) diagrams representing the interaction 
of the vector current with the nucleon 


diagrams in Fig. 5.24 grows exponentially with a rate proportional to exp{(my — 
3m )t;), where my and m; denote the nucleon and pion masses, respectively, 
and f, is the source-sink separation. Techniques designed to enhance the statistical 
signal at affordable numerical cost have been developed and applied, including 
the truncated solver method [450] and “all-mode-averaging” [451]. Furthermore, a 
technique to achieve an exponential error reduction via domain decomposition and 
multi-level integration has been proposed and tested in [452, 453]. So far, it has not 
been employed in actual calculations of nucleon matrix elements with dynamical 
quarks. 

Quark-disconnected diagrams of the type shown on the right of Fig. 5.24 are 
intrinsically even noisier than their quark-connected counterparts and require special 
techniques that balance statistical accuracy against numerical cost. Commonly 
applied variance reduction techniques for quark-disconnected diagrams include 
hierarchical probing [454, 455], the coherent source sequential propagator method 
[389, 456] low-mode averaging [457, 458], the hopping parameter expansion [450, 
459-461] and partitioning/dilution [275, 462]). 

Despite these improvements, typical values of the source-sink separation ts for 
which the signal has not yet disappeared into the noise are limited to ts = 1.5 fm. 
Since the correlation function is dominated by the ground state for t, (t, — t) > 
oo, it is then not guaranteed that the matrix element of interest can be extracted 
without incurring a bias from unsuppressed excited state contributions, as long as 
one cannot probe the region t, > 1.5 fm. Hence, in addition to “standard” systematic 
effects such as lattice artefacts or finite-volume effects, one must also ensure that 
the asymptotic regime of nucleon correlation functions has been correctly isolated. 
Indeed, controlling excited state effects has become perhaps the most important 
issue in current lattice calculations of nucleon matrix elements. The commonly used 
strategies include 


* fits to three-point correlation functions or suitably defined ratios of correlators 
including sub-leading contributions from excited states [393, 394]; 

* calculations of three-point correlators summed over the operator insertion time f 
[463—467]. Contributions from excited states can be shown to be parametrically 
more strongly suppressed than in the standard case [468]; 
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* increasing the projection of nucleon interpolators onto the ground state [404, 
469], as well as the construction of an operator basis for the variational method, 
which allows for the projection onto the approximate ground state [456, 469, 
470]. 


The first two approaches proceed by fitting data obtained in a finite interval of 
source-sink separations f; to a function that describes the approach to the asymptotic 
behaviour. To be able to resolve the sub-leading contributions from excited states in 
such a fit obviously requires sufficiently precise input data. 

Another challenge for lattice calculations of nucleon matrix elements is the 
accurate description of the pion mass dependence. Although simulations at or near 
the physical pion mass are now routinely performed, the result at the physical 
point is often obtained via an extrapolation in the pion mass. The fit ansatz for 
the pion mass dependence is usually derived from chiral effective theory. However, 
the convergence properties of baryonic chiral perturbation theory are not as well 
understood as in the mesonic sector, and it is still unclear whether the predicted 
functional form provides a good description in the pion mass range over which it is 
applied. It is thus mandatory to gather sufficiently precise results at small enough 
pion mass, in order to control the systematic uncertainty associated with the chiral 
extrapolation. 

Instead of performing a detailed survey of a variety of nucleon observables, we 
single out one particular quantity—the iso-vector axial charge of the nucleon, g4, 
which is perhaps the most widely studied of nucleon matrix elements in lattice QCD 
and serves to illustrate the current state of the art. The axial charge describes the 
coupling of the W boson to the nucleon. In Minkowski space notation it is defined 
by 


(p(k, s^) | wy" ysd |n(k, s)) = ga wp(k, s") y" ys un (k, s), (5.255) 


where u,(k, s) and uy (k, s’) denote the Dirac spinors of the neutron and proton 
with four-momentum K and spins s and s', respectively. The axial charge has 
been measured experimentally in neutron £-decay, and the current world average 
quoted in the PDG is g4 = 1.2724 + 0.0023 [471]. Provided that the experimental 
sensitivity is sufficient, it may be possible to probe for scalar and tensor interactions 
that are generated by loop effects or arise due to new forces in extensions of the SM. 
The definitions of the associated scalar and tensor charges, gs and gr are derived 
from Eq. (5.255) by replacing the axial current uy” ysd by the scalar density ud and 
the tensor current wo" d, respectively. 

The calculation of g4 is facilitated by the fact that it is derived from a forward 
matrix element without any momentum transfer and, secondly, since the contri- 
butions from quark-disconnected diagrams cancel in the iso-vector combination, 
for mass-degenerate up and down quarks. Coupled with the fact that a precise 
experimental value is known, the iso-vector axial charge is a benchmark quantity 
for lattice calculations of nucleon matrix elements. Obviously, the ability of state- 
of-the-art lattice calculations to reproduce the experimental result will enhance the 
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Fig. 5.25 Compilation of recent results for the isovector axial charge. The vertical red band 
indicates the PDG average [471]. Lattice results are labelled by PNDME 18 [411], CalLat 18 
[410], PNDME 16 [406], Mainz/CLS 19 [414], PACS 18 [397], x QCD 18 [413], JLQCD 18 [412], 
LHPC 12 [392], LHPC 10 [389], Mainz/CLS 17 [409], ETMC 17 [407], ETMC 15 [405], ROCD 14 
[404], QCDSF 13 [403] and Mainz/CLS 12 [402] 


credibility of lattice predictions for the unmeasured charges gs and gr. Figure 5.25 
shows a compilation of recent results for g4, obtained in lattice QCD with Ny = 
2, 2-- 1 and 2-- 1 4- 1 flavours of dynamical quarks. While most estimates agree with 
the experimental result within errors, it is clear that the overall precision of current 
lattice calculations does not match that of the experiments. To state this observation 
more precisely, we note that the typical total error of current lattice results is at the 
level of 1-3% while experiment is an order of magnitude more precise. It should 
also be mentioned that, more often than not, lattice results tend to be slightly lower 
that the PDG average. Whether this is due to a remnant bias from excited state 
contributions or indeed to any other systematic effect, must be investigated in future 
calculations able to realize larger source-sink separations. 

The tendency to underestimate g4 in early lattice calculations of g4 has been 
attributed to unsuppressed excited state effects. In this context it is interesting to 
note that recent analyses of the contributions from Nz states to nucleon matrix 
elements based on chiral effective theory [472, 473] suggest that the asymptotic 
(physical) value of g4 is approached from above. The different conclusions drawn 
from numerical and analytic studies can only be reconciled if one succeeds in 
simulating significantly larger source-sink separations at affordable cost. 
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Given that lattice QCD calculations reproduce the experimental value of bench- 
mark quantities such as the axial charge at the level of a few percent, it is interesting 
to look at quantities that have not been measured so far. Results for the (iso-vector) 
scalar and tensor charges have been reported in [386, 393, 404—406, 411, 412, 414— 
419]. For both quantities one obtains gs, gr ^ 1, and while the typical overall 
uncertainty in gs is at the level of 10%, the tensor charge is determined with 3% 
precision, similar to that of g4. The 2019 edition of the FLAG report contains 
a detailed compilation and comparison of results for the axial, scalar and tensor 
charges, as well as flavour-singlet charges and o-terms. Calculations of these 
quantities have matured to a level which allows for global averages to be determined. 

Lattice calculations of nucleon matrix elements is a rich subject, and while a 
comprehensive discussion of other quantities such as form factors and moments of 
PDFs is beyond the scope of this short review, we refer the reader to recent reviews 
[445—447], specific sections of [448] and the white paper on PDFs [449]. 


5.9.5 Hadronic Contributions to the Muon Anomalous 
Magnetic Moment 


The SM describes with great accuracy and precision the properties of the con- 
stituents of the visible matter in the universe but leaves several profound questions 
unanswered. For instance, it cannot account for the matter-antimatter asymmetry 
and does not explain the vast hierarchy between the electroweak scale and the Planck 
mass. Most prominently, the SM cannot account for the presence of dark matter in 
the universe for which there is overwhelming observational evidence. Against this 
backdrop, the exploration of the limits of the SM and the search for “new physics” 
has become a major activity in particle physics. Traditionally, high-energy particle 
colliders have had the highest discovery potential. However, despite the fact that 
the LHC is the most powerful accelerator in the world, new particles that can, for 
instance, explain the dark matter puzzle have not been observed in the expected 
region. Therefore, additional search strategies must be pursued to detect evidence 
for physics beyond the SM. 

Observables that can be measured with very high precision and for which 
similarly accurate theoretical predictions exist at the same time, play an increasingly 
important róle for exploring the limits of the SM. One such quantity is the 
anomalous magnetic moment of the muon, a, = i( g&u — 2), where g, denotes 
the muon's gyromagnetic ratio. There has been a persistent tension of about 3.5 
standard deviations between the measured value and the SM prediction [244]: 


ay? — aM = (266 + 76) - 107. (5.256) 


As described in detail in the extensive reviews in Refs. [474] and [475], the SM 
estimate of the anomalous magnetic moment receives contributions from QED, the 
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weak and the strong interactions, i.e. 


a$M — qQED 4 awak 4 qne. (5.257) 


While QED effects account for about 99.994% of the absolute value of a. its 


total uncertainty is completely dominated by the contribution from i . Since the 
latter is mostly due to hadronic effects that are intrinsically non-perturbative, it is 
clear that special attention must be paid to their reliable evaluation. 

The most important quantum corrections to a?M arising from strong interaction 
physics are the leading hadronic vacuum polarization (HVP) and hadronic light-by- 


light scattering (HLbL) contributions. The HVP contribution, s. which arises at 
order o? (where a is the fine structure constant), can be expressed in terms of a dis- 
persion integral of the cross section ratio R(s) = o(e* e^ — hadrons)/o(e* e^ > 
iL* n), multiplied by a known kernel function. At small values of the centre-of- 
mass energy 5, the dispersion integral is evaluated using experimental data for the 
R-ratio R(s) as input [476—480]. For instance, the recent analysis of Ref. [479], 
which is based on the available data for e* e^ — hadrons, produced an estimate of 


ur = (693.1 + 3.4) - 10-9. While the total error is at the level of 0.5%, it is clear 
that experimental uncertainties enter the SM prediction for a,, in this approach. 
The HLbL contribution has been quantified mostly using hadronic models, 
although efforts are under way to formulate and apply a dispersive or data-driven 
framework to treat some of the dominant sub-processes [481—491]. The current SM 


estimate qu is based on model calculations such as the “Glasgow consensus", i.e. 


a = (105 + 26) - 10~!! [492]. Other studies, which have produced consistent 
results, can be found in Refs. [474, 478, 493]. 

Given the importance of a,, for testing the limits of the SM, it is crucial to verify 
hlbl 
H 


the current estimates of ay,’ and a? and possibly reduce their overall errors using 
an ab initio approach such as lattice QCD. Given that two new experiments (E989 at 
Fermilab and E34 at J-PARC) are set to improve the precision of the measurement of 
da, by a factor four, the importance of reliably estimating the hadronic contributions 


has become even higher. In order to make an impact, lattice QCD must be able to 


> h : : f 
constrain du with sub-percent accuracy, while an estimate of a at the level 


of 10% would already be a major step forward. Both tasks, however, present a 


considerable challenge to lattice QCD. The current status of lattice calculations of 


T and a was reviewed extensively in Ref. [494], which can be consulted for 


details. Here we present merely an overview of the main issues and a guide to the 
literature. 

The hadronic vacuum polarization contribution, a , is accessible in lattice QCD 
via different integral representations involving the correlator of the electromagnetic 
current. The first possibility is to consider a convolution integral over Euclidean 
momenta Q? of the subtracted vacuum polarization function [500, 501]. The second 
possibility is the so-called time-momentum representation defined in Ref. [502], in 
which the product of the spatially summed vector correlator G(xo) and a kernel 


P 
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function is integrated over the Euclidean time xo. A variant of the time-momentum 
representation uses the time moments of G(xo) [503]. Finally, there also exists 
a Lorentz-covariant formulation in coordinate space [504] involving the point-to- 
point vector correlator G(x, y). 

In order to meet the precision goal of sub-percent uncertainty, it is mandatory to 
have good control over the infrared regime which makes a sizeable contribution to 
gp In the formulation of Refs. [500, 501] this implies that momenta corresponding 
to Q^? < mi, must be included, since this is where the convolution integral 
receives its dominant contribution. Instead, in the time-momentum representation 
or the Lorentz-covariant formulation one must constrain the long-distance regime 
of the correlator sufficiently well. The statistical accuracy that one can attain 
for qur is affected by the well-known noise problem encountered for the vector 
correlator, i.e. the fact that the signal-to-noise ratio increases exponentially at large 
distances.** Another limiting factor for the overall precision of an” in lattice 
QCD is the knowledge of the lattice scale [499, 505]. At first sight this may seem 

ios re hvp . i ? P p 
surprising, given that a," is a dimensionless quantity. However, employing the 
time-momentum representation, one easily sees that the lattice scale enters through 
the combination (xom ui in the kernel function. Similar arguments exist for the 


other representations of ay”. Furthermore, at the level of sub-percent precision, it 
is necessary to include the contributions from quark-disconnected diagrams and the 
effects from isospin breaking (see Sect. 5.9.2). All of this is explained in great detail 
in Ref. [494]. 

First exploratory calculations of ae in full QCD were published in 2008 [506], 
and in the following years several studies appeared [497, 507—509], employing a 
range of different discretisations of the quark action, which were mostly aimed 
at investigating systematic effects. The most recent calculations are focussed on 
reducing the overall uncertainties [495, 496, 498, 499, 510—515, 530]. A comparison 
of recent estimates for aP from lattice QCD to results obtained via the dispersive 
approach is shown in Fig. 5.26. As of now, current calculations cannot match 
the accuracy of the dispersive approach, but efforts are under way to reduce the 
uncertainties to a level that makes the lattice approach competitive with data-driven 
methods [494, 516]. 

In order to determine the hadronic light-by-light scattering contribution, it is 
necessary to formulate the problem in such a way that abl! is expressed in terms 
of quantities that can be computed on the lattice with affordable effort. Several 
different strategies have been proposed and are currently being pursued: 

In a first method, the matrix element of the electromagnetic current between 
explicit muon initial and final states is computed is QCD+QED [517]. In order to 
isolate the desired light-by-light scattering contribution, one has to perform a non- 
perturbative subtraction. While the method has produced estimates in the expected 


24This is similar to, but less severe, than the noise problem encountered in nucleon correlation 
functions discussed in Sect. 5.9.4 of this review. 
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Fig. 5.26 Compilation of recent results for the hadronic vacuum polarisation contribution in 
units of 10719. The three panels represent calculations with different numbers of sea quarks. 
Lattice results are labelled by ETMC 18 [515], BMW 17 [495], HPQCD 16 [496], ETMC 13 
[497], Mainz/CLS 19 [530], RBC/UKQCD 18 [498], and Mainz/CLS 17 [499]. The phenomeno- 
logical determinations based on the R-ratio are labelled as HLMNT 11 [477], DHMZ 11 [476], 
Jegerlehner 17 [478] and KNT 18 [480]. The red vertical band denotes the estimate from dispersion 
theory quoted in KNT 18 [480] 


range, statistical errors are large, as a result of the cancellation between two large 
numbers [518]. 

In another method proposed by the RBC/UKQCD Collaboration [519, 520], 
the light-by-light scattering diagram is evaluated by inserting three explicit photon 
propagators. The positions of the insertion of these propagators are then sampled 
stochastically. In this way, results for the quark-connected and the leading quark- 
disconnected contributions have been obtained, 1.e. 


(ahe = (116.0-9.6)-107!', ahh tse = (—62.5-E8.0).107!!. — (5.258) 


The sum of the two contributions gives ahh! = (53.5 + 13.5) - 107!! which 
differs from the Glasgow consensus by a factor two. However, before jumping to 
conclusions one must take into account that systematic effects have not yet been 
fully quantified in these calculations. 

The Mainz group has proposed a method in which the QED kernel function is 
computed semi-analytically in infinite volume [521—524]. This has the advantage 
that large finite-volume effects arising from the massless photon mode are absent. 
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The method has yet to produce explicit estimates for a A variant was proposed 
by RBC/UKQCD in Ref. [525]. Another project of the Mainz group has focussed on 
the forward light-by-light scattering amplitude, which can be linked via the optical 
theorem and dispersive sum rules to models of the cross section for the process 
y'y* — hadrons [526, 527]. The results provide an important test for model 
estimates of a 

Finally, lattice QCD calculations can also be used to directly test model estimates 
of the expected dominant contribution to a from the pion pole, which requires 
knowledge of the transition form factor for 19 — y*y*. The calculation of Ref. 
[528], which was performed in two-flavour QCD, gives 


(alll)? — (65.0 + 8.3) - 107! (5.259) 
which is in very good agreement with model estimates [491]. It will be interesting 
to extend this calculation by including the corresponding contributions of the n and 
5 mesons. 

This brief survey demonstrates that lattice QCD contributes in many different 
and complementary ways to constrain the hadronic contributions to the muon g — 2 
more precisely. 


5.9.6 Concluding Remarks 


In this short review we have charted the progress of lattice QCD calculations over 
more than a decade, i.e. since the publication of the original review article. Back in 
2007, lattice QCD was on the verge of providing estimates for hadronic observables 
from first principles, which were of immediate phenomenological relevance. In the 
meantime, lattice QCD has become an indispensable tool in particle and hadron 
physics: In addition to to providing accurate estimates of SM parameters and input 
quantities for analyses in flavour physics, lattice QCD is now also making inroads 
into field such as nucleon structure and precision observables. This underlines the 
important role of lattice calculations for exploring the limits of the SM and searches 
for new physics. 

Furthermore, studying hadronic interactions, i.e. the physics of resonances and 
multi-hadron systems, has become a major activity in lattice QCD and also serves 
as a basis for the understanding of light nuclei from first principles. Other important 
applications of the lattice formulation that have not been covered in this article 
are studies of matter under extreme conditions. Indeed, many features of the 
QCD phase diagram and properties of the quark-gluon plasma that are otherwise 
inaccessible can nowadays be obtained reliably from lattice calculations. Perhaps 
the most significant development since Ken Wilson's 1989 remark, quoted in the 
introduction, is the fact that there is now a vigorous interaction between lattice QCD 
and experiment. 
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Chapter 6 A 
The Discovery of the Higgs Boson PEE 
at the LHC 


Peter Jenni and Tejinder S. Virdee 


6.1 Introduction and the Standard Model 


The standard model of particle physics (SM) is a theory that is based upon principles 
of great beauty and simplicity. The theory comprises the building blocks of visible 
matter, the fundamental fermions: quarks and leptons, and the fundamental bosons 
that mediate three of the four fundamental interactions; photons for electromag- 
netism, the W and Z bosons for the weak interaction and gluons for the strong 
interaction (Fig. 6.1). 

The SM provides a very successful description of the visible universe and has 
been verified in many experiments to a very high precision. It has an enormous range 
of applicability and validity. So far no significant deviations have been observed 
experimentally. 

The possibility of installing a proton-proton accelerator in the LEP tunnel, after 
the e*e^ programme, was being discussed in the 1980's. At the time there were 
many profound open questions in particle physics, and several are still present. In 
simple terms these are: what is the origin of mass i.e. how do fundamental particles 
acquire mass, and why do they have the masses that they have? Why is there more 
matter than anti-matter? What is dark matter? What is the path towards unification of 
all forces? Do we live in a world with more space-time dimensions than the familiar 
four? The LHC [1, 2] was conceived to address or shed light on these questions. 
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Leptons 


Fig. 6.1 Particle content of the SM, including the Higgs boson considered to be the keystone of 
the SM 


The question of how fundamental particles acquire mass was first posed in the 
following form: how does the photon remain massless, giving the electromagnetic 
force an infinite range, whilst the W and Z bosons acquire a seemingly large mass, 
explaining the short-range of the weak nuclear force. 

In 1964 three groups of physicists, Englert and Brout; Higgs; and Guralnik, 
Hagen, and Kibble [3-7], proposed that there exists an omnipresent field, pervading 
the universe, and fundamental particles can acquire mass by interacting with this 
field. At the heart of the mechanism endowing mass was spontaneous symmetry 
breaking of a local gauge symmetry, through the field's non-zero vacuum expecta- 
tion value. The new field being a quantum field had an associated quantum, which 
became known as the Higgs boson. 

Today, it seems remarkable that not much attention was paid to the papers [3-7], 
and even less to the associated Higgs boson. This was partly due to the fact that in 
the early 1960's most particle physicists were trying to make sense of a plethora of 
new particles being discovered. 

In 1967 Kibble [8] generalized his earlier work with Guralnik and Hagen and 
brought the mechanism of spontaneous symmetry breaking closer to its application 
to the description of the real world, one in which the photon remains massless and 
the W and Z particles become massive [9]. This vein of work reached fruition in 
the seminal papers of Weinberg [10] and Salam [11], which raised the prospect 
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of the unification of electromagnetism and weak interactions, now labeled the 
electro-weak theory. Earlier work on a similar model had been carried out by S. 
Glashow [12]. Weinberg and Salam assumed that W and Z bosons acquired mass by 
interacting with the field introduced in the earlier papers [3-7]. 

Both Weinberg and Salam conjectured that such a model would be renormaliz- 
able i.e. calculations would give finite answers. The key prediction of their theory 
was the existence of the Z° boson, in addition to the long-known charged W bosons. 
Again not much attention was paid to these papers. 

The situation changed dramatically in 1971. t' Hooft in a tour de force, using 
methods developed by Veltman, outlined the proof that, indeed, the electro-weak 
theory would be renormalizable [13]. The electro-weak theory started being taken 
very seriously, so much so that Weinberg’s paper [10] has now become the most 
cited paper in physics. 

Experimentally, the 1973 discovery of weak neutral currents [14], mediated by 
the Z° boson, provided strong evidence for the verity of the electro-weak theory. 

In parallel much progress had been made in understanding the particles that were 
being discovered in the 1950s and 1960s. Eventually, these were understood through 
an underlying gauge field theory, where the “charge” of strong interactions was 
labeled “colour”, and the interactions of coloured quarks are mediated by gluons. 
The theory [15, 16] displayed two main properties: colour confinement, resulting in 
the hadrons being colourless, and asymptotic freedom, leading to a steady decrease 
in the strength of the interaction between quarks and gluons as the interaction energy 
scale increases. The latter enabled the use of perturbation theory for calculating 
strong interaction processes at high energies, which has been key to understanding 
the physics at the LHC. 

Further major discoveries included those of new quarks and the gluon meant that 
the discovery, in 1983, of the W and Z bosons [17, 18] at CERN set the stage for the 
search for the Higgs boson. The Higgs boson, that earlier had been considered to be 
a minor and uninteresting feature of the spontaneous breaking mechanism, became 
to assume a role of central importance as the still missing key particle of the SM. 
The SM worked so well that the Higgs boson, or something else doing the same job, 
more or less had to be present. 

In 1984, one year after the discovery of the W and Z bosons, a workshop was held 
in Lausanne where first ideas were discussed about a possible proton-proton collider 
and associated experiments to make a search for such a particle. The aim was to 
reuse the LEP tunnel after the end of the electron-positron programme. Amongst 
the leading protagonists were the scientists from UAI and UA2 experiments. An 
exploratory machine was required to cover the wide range of mass values possible 
for the SM Higgs boson, its diverse decay signatures and production mechanisms 
and to discover any new high-mass particles at a centre-of-mass energy ten times 
higher than previously probed. A hadron (proton-proton) collider is such a machine 
as long as the proton energy is high enough and the instantaneous luminosity, Z, 
measured in cm"? s~!, is sufficiently large. The rate of production of a given particle 
is determined by “ x o where c is the cross section of the production reaction, 
measured in units of cm?. The most interesting and easily detectable final states at a 
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hadron collider involve charged leptons and photons and have a low o x BR, where 
BR is the branching ratio into the decay mode of interest. 

A major goal of the LHC thus became the elucidation of the mechanism for 
electroweak symmetry breaking. It also was clear that a search had to be made for 
new physics at the TeV energy scale as the SM is logically incomplete; it does 
not incorporate gravity. A promising avenue is the superstring theory, an attempt 
towards a unified theory with dramatic predictions of extra space dimensions and 
supersymmetry. 

The LHC and its experiments were designed to find new particles, new forces 
and new symmetries amongst which could be the Higgs boson(s), supersymmetric 
particles, Z’ bosons, or evidence of extra space dimensions. An experiment that 
could cover the detection of all these hypothesized but yet undiscovered particles 
would provide the best opportunity to discover whatever else might be produced at 
LHC energies. 

In July 2012 the ATLAS and CMS collaborations discovered a Higgs boson [19, 
20]. 

This paper is based on the previous articles [1, 21—23] written by the authors, 
some with M. Della Negra, using the recently published results from the ATLAS 
and CMS Collaborations on the measurements of the properties of the Higgs boson. 


6.2 The SM Higgs Boson 


In the early 1990's the search for the SM Higgs boson played a pivotal role in the 
design of the ATLAS and CMS experiments. The mass of the Higgs boson (mg) 
is not predicted by theory, but from general considerations, my « 1 TeV. At the 
start of the LHC operation, direct searches for the Higgs boson carried out at the 
LEP collider led to a lower bound of my > 114.4 GeV at 9596 CL [24], whilst 
precision electroweak constraints, including LEP data, implied that my « 152 GeV 
at 9596 confidence level (CL) [25]. At time of the discovery at CERN, CDF and DO 
experiments operating the Tevatron proton antiproton collider, detected an excess of 
events in the range 120-135 GeV [26]. 

It is known that quantum corrections make the mass of any fundamental scalar 
particle, such as the SM Higgs boson, float up to the next highest mass scale 
present in the theory, which in the absence of extensions to the SM, can be as 
high as 10? GeV. Hence finding the scalar Higgs boson would immediately raise a 
more puzzling question: Why should it have a mass in the range between 100 GeV 
and 1 TeV? One appealing hypothesis, much discussed at the time, and still being 
investigated, predicts a new symmetry labeled supersymmetry. For every known SM 
particle there would be a partner with spin differing by half a unit; fermions would 
have boson superpartners and vice versa, thus doubling the number of fundamental 
particles. The contributions from the boson and fermion superpartners, and vice a 
versa, with amplitudes of opposite signs, would lead to their cancellation, and allow 
a low mass for the Higgs boson. In supersymmetry five Higgs bosons are predicted 
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to exist with one resembling the SM Higgs boson with a mass below about 140 GeV. 
The lightest of this new species of superparticles could be the candidate for dark 
matter whose presence, by mass, in the universe is around five times more abundant 
than ordinary matter. 

In 1975, physicists had already started to turn their attention to how a putative 
Higgs boson would manifest itself in experiments [27]. 

The search for the SM Higgs boson provided a stringent benchmark for evaluat- 
ing the physics performance of various experiment designs under consideration in 
the early 1990s and heavily influenced the conceptual design of the general-purpose 
experiments, ATLAS and CMS. 


6.2.1 Higgs Boson: Production and Decay 


Although the mass of the Higgs boson is not predicted by theory, at a given mass all 
of its other properties are precisely predicted within the SM. The SM Higgs boson 
is short-lived (1077 s) and hence the experiments only detect the decay products. 

The cross sections for differing production mechanisms and the branching 
fractions for differing decay modes of the SM Higgs boson, as a function of 
mass, are illustrated in Fig. 6.2a, b, respectively [28], and the principal ones for 
mg = 125 GeV and at ys = 14 TeV are tabulated in Table 6.1. The uncertainties 
on these numbers can be found in the twiki in Reference [28]. 
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Fig. 6.2 (a) SM Higgs boson production cross sections as a function of the centre-of-mass energy, 
A/5. for pp. collisions. The VBF process is indicated here as qqH [28]. The theoretical uncertainties 
are indicated as bands. (b) Branching ratios for the main decays of the SM Higgs boson near 
mg — 125GeV [28]. The theoretical uncertainties are indicated as bands 
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Table 6.1 The Production cross sections and decay branching fractions at ./s = 14 TeV 


Production process Cross section (pb) Decay Mode H^ Branching fraction (96) 
gg fusion (pp —> H) 54.67 bb 58.24 
qqH (VBF) | 4.28 ww) [21.37 
WH (associated) 1.51 Tt | 627 
ZH (associated) 0.99 ZZ) 2.62 
ttH 0.61 yy 2.27 x 107! 
uu 2.18 x 107°? 


The dominant Higgs-boson production mechanism, labeled pp. — H in Fig. 6.2a 
(for masses up to ~ 700 GeV) is gluon-gluon fusion. 

The vector boson fusion (VBF) mechanism (WW? or ZZ“), labeled pp. > qqH 
in Fig. 6.2a, becomes important for the production of higher-mass Higgs bosons. 
Here, the quarks that emit the W or Z bosons have transverse momenta of the order 
of W and Z masses. The detection of the resulting high-energy jets in the forward 
pseudorapidity! regions, 2.0 < || < 5.0, can be used to tag the reaction, improving 
the signal-to-noise ratio. Tagging of forward jets from the VBF process has turned 
out to be very important in the measurements of many of the properties of the Higgs 
boson. 

The production of the Higgs boson in association with W and Z boson, labeled 
pp. > W or Z H in Fig. 62a, or the production via the t-tbar fusion, has a much 
lower cross section, but nevertheless has been important for the final states with 
large backgrounds such as b-bbar, t+ v^ or pt. 

The Higgs boson decays in one of several ways (decay modes) into known SM 
particles, the types depending on its mass. Hence a search had to be envisaged not 
only over a large range of masses but also many possible decay modes: into pairs of 
photons, Z bosons, W bosons, t leptons, and b quarks. 

In the mass interval 110 < my < 150 GeV, early detailed studies indicated that 
the two-photon decay would be the main channel likely to give a significant signal 
[29]. Detailed studies of another mode, H > ZZ) — ££, where £ stands for 
a charged electron or a muon, dubbed the "golden" mode, suggested that it could 
be used to cleanly detect the Higgs boson over a wide range of masses starting 
around mg — 130 GeV [30]. One or both of the Z bosons would be virtual for 
mg « 180 GeV, and the upper end of the detection range was indicated to be about 
mg « 600 GeV. 

In the region 700 « mg « 1000 GeV the cross-section decreases so Higgs boson 
decays via W and Z decays, where the W and Z decays are to channels with higher 
branching fractions, have to be employed. 


l The pseudorapidity 7 = —In[tan(6/2)] where and 0 is the polar angle measured from the positive 
z axis (along the beam direction). 
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6.3 The Large Hadron Collider 


6.3.1 The Road to the LHC 


With the prospect of ground-breaking physics at the LHC, several workshops 
and conferences followed, where the formidable experimental challenges started 
to appear manageable, provided that enough R&D work on detectors could be 
carried out. In 1987 the workshop in La Thuile of the so-called “Rubbia Long- 
Range Planning Committee” resulted in the recommendation of a proton-proton 
collider, labeled the Large Hadron Collider (LHC), as the next accelerator for 
CERN. Meetings of note were the ECFA LHC Workshop in Aachen in 1990 
[31], and “Towards the LHC Experimental Programme" [32] which took place 
in Evian-les-Bains, France in March 1992. At Evian several proto-collaborations 
presented their designs in “Expressions of Interest”. In addition, from the early 
1990s, CERN’s LHC Detector R&D Committee (DRDC), which reviewed and 
steered R&D groupings, greatly stimulated innovative developments in detector 
technology. 

Table 6.2 lists the major steps on the long road to the discovery of the Higgs 
boson. 


Table 6.2 The LHC Timeline 


1984 | Workshop on a Large Hadron Collider in the LEP tunnel, Lausanne, Switzerland. 


1987 | Workshop on the Physics at Future Accelerators, La Thuile, Italy. The Rubbia 
“Long-Range Planning Committee” recommends the Large Hadron Collider as the 
right choice for CERN’s future. 


1990 | LHC Workshop, Aachen, Germany (discussion of physics, technologies and detector 
design concepts). 


1992 | General Meeting on LHC Physics and Detectors, Evian-les-Bains, France (with four 
general-purpose experiment designs presented). 


1993 | Three Letters of Intent evaluated by the CERN peer review committee LHCC. ATLAS 
and CMS selected to proceed to a detailed technical proposal. 


1994 | The LHC accelerator approved for construction, initially in two stages. 

1996 | ATLAS and CMS Technical Proposals approved. 

1997 | Formal approval for ATLAS and CMS to move to construction (materials cost ceiling 
of 475 MCHF). 

1997 | Construction commences (after approval of detailed Technical Design Reports of 
detector subsystems). 

2000 | Assembly of experiments commences, LEP accelerator is closed down to make way for 
the LHC. 

2008 |LHC experiments ready for pp. collisions. LHC starts operation. An incident stops 
LHC operation. 

2009 |LHC restarts operation, pp. collisions recorded by LHC detectors. 

2010 |LHC collides protons at high energy (centre of mass energy of 7 TeV). 

2012 | LHC operates at ./s = 8 TeV: discovery of a Higgs boson. 

2015 | LHC operates at ys = 13 TeV. 
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6.3.2 The Challenges of the LHC Accelerator 


In this section we outline some of the features and the technological challenges of 
the LHC [1]. 

Protons are accelerated by high electric fields generated in superconducting r.f. 
cavities and are guided around the accelerator by powerful superconducting dipole 
magnets. The dipole magnets are designed to operate at 8.3 Tesla, allowing the 
proton beams to be accelerated to 7 TeV, with the current carrying conductor cooled 
down to 1.9 K in a bath of superfluid helium. The beam pipe in which the protons 
circulate is under a better vacuum, and at a lower temperature, than that found in 
inter-planetary space. 

The choices of two-in-one high-field superconducting dipole magnets operating 
at a temperature of 1.9 K, cooled by super-fluid helium were critical to a competitive 
and affordable design. The LHC could only be competitive with the Superconduct- 
ing Super Collider (SSC), whose construction had started in the early 1990s in 
Texas, U.S.A, if the instantaneous luminosity could be an order of magnitude higher 
(at 109^ cm~? s-!). However, the SSC was later cancelled in October 1993. 

The main challenges for the accelerator were to build more than one thousand 
two hundred 15 m long superconducting dipoles able to reach the required 
magnetic field, the large distributed cryogenic plant to cool the magnets and other 
superconducting accelerator structures, and the control of the beams, whose stored 
energy will reach, in design operation, a value of 350 MJ. This magnitude requires 
extraordinary precautions for beam handling, since if, for any reason this beam is 
lost in an uncontrolled way, it can do considerable damage to the machine elements, 
which would result in months of down time. 

The counter-rotating LHC beams are organized in 2808 bunches, each of ~10!! 
protons per bunch separated by 25 ns, leading to a bunch crossing rate of -40 MHz. 

Proton beams were first circulated in the LHC in September 2008, and in the 
days that followed, rapid progress was made in getting a beam to circulate with 
very good lifetime. Soon after the start a technical incident occurred in the last of 
the eight sectors to be tested as it was being ramped up to the pre-agreed start- 
up energy of 5 TeV. The root cause was a failure of one of the 50,000 soldered 
joints. Substantial damage was done to a large part of the sector involved. After 
repairs lasting about a year, the LHC started operating again in November 2009. 
Collisions took place at the injection energy (450 GeV per beam), followed in 2010 
and 2011, by a very successful operation at a centre-of-mass energy of 7 TeV. In 
2012 the centre-of-mass energy was increased to 8 TeV. The performance surpassed 
expectations and an integrated luminosity of ~25 fb~!, corresponding to 2 x 10!° 
proton-proton interactions, was delivered. This is labeled Run 1. 

During the period 2015-2018 the LHC operated at a proton-proton centre- 
of-mass energy of 13 TeV and delivered a total of over 150 fb-! of integrated 
luminosity. The collider performed close to, or beyond, its design values in many 
parameters, operating at 13 TeV and reaching peak luminosities of 2 x 10?4 
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Table 6.3 Some of the LHC Achieved 


N : Design 
parameters of attained/design Energy 13 TeV 14 TeV 
performance for ATLAS and 
CMS Max. no. of bunches 2556 2808 

Bunch spacing (ns) 25 25 
Protons/bunch (10!!) 1.1 1.15 
B (cm) 30 55 
Peak luminosity (1034 cm? s7!) | 2.1 1.0 
Total integrated luminosity (fb~!) 

Vs =7 TeV (Run 1) 5 

A/s = 8 TeV (Run 1) 20 

JVs = 13 TeV (Run 2) 140 


cm? s^, twice the design value. This period of operation is labeled Run 2. The 


achieved performance at the time of writing (2018) can be found in Table 6.3. 


6.4 The ATLAS and CMS Experiments 


Not only was the putative SM Higgs boson to be rarely produced in the proton 
collisions, but also it decays into particles (isolated photons, electrons, and muons) 
that are the best identifiable signatures of its production at the LHC also was 
expected to be rare. The rarity is illustrated by the fact that Higgs boson production 
and decay to one such distinguishable signature (H > ZZ“) — 4 I) happens roughly 
once in 10'? proton-proton collisions. So a vast number of proton-proton collisions 
per second have to be delivered by the accelerator and examined by the experiments. 
At the end of 2018, the LHC was operating at a collision rate of around 10? per 
second. The ATLAS and CMS detectors operate in the harsh environment created 
by this huge rate of proton-proton collisions. The challenges posed are discussed in 
reference [33, 34]. 


6.4.1 The Challenges for ATLAS and CMS Experiments 


At the Aachen workshop the physics case for the LHC was thoroughly examined. 
The experimental search for the Higgs boson across the entire possible range of 
mass was fully explored for the first time. There was a prevalent prejudice of the 
protagonists of supersymmetry that my should be smaller than 135 GeV. As the 
decay width of the SM Higgs boson is about 5.5 MeV at my = 100 GeV, and 
8.3 MeV at 150 GeV, the width of the reconstructed invariant (yy or 4 J) mass 
distribution, and hence the signal/background ratio, would be limited by the elec- 
tron/photon energy resolution of the electromagnetic calorimeter, and the charged 
particle momentum resolution of the inner tracker and the muon spectrometer. This 
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lower end of the remaining open mass range was considered to be especially difficult 
in hadron colliders. Hence the LHC experiments had to pay particular attention to 
the performance requirements imposed by the search for the Higgs boson in this 
low mass range. As a consequence much importance was placed on the tracking 
(inner and muon), as well as the magnetic field strength, and the electromagnetic 
calorimeters. 

The search for the high-mass Higgs boson, particles predicted by SUSY, and 
other exotic states mentioned above, required excellent resolution for jets and 
missing transverse momentum (pr?5*), requiring full solid angle calorimeter 
coverage. 

A saying prevalent in the late 1980's and early 1990's captured the challenge: 
*We think we know how to build a high energy, high luminosity hadron collider— 
but we don't have the technology to build a detector for it'. Making discoveries in 
the unprecedented high collision rate environment, generated by around one billion 
proton-proton interactions per second, with several tens of simultaneous collisions 
per bunch crossing, would require extraordinary detectors. Many technical, finan- 
cial, industrial and human challenges lay ahead, which were all overcome, to yield 
experiments of unprecedented complexity and power. A flavour can be attained from 
articles in reference [35]. 

At the Evian meeting in 1992 four experiment designs were presented: two 
deploying toroids (one with a superconducting magnet in the barrel) and two 
deploying superconducting high-field solenoids. The choice of the magnetic field 
configuration determined the overall design of the experiments. 

The collaborations deploying toroids merged to form the ATLAS Collaboration. 
The ATLAS design [35] was based on a very large superconducting air-core toroid 
for the measurement of muons, and supplemented by a superconducting 2 Tesla 
solenoid to provide the magnetic field for inner tracking and by a liquid-argon/lead 
electromagnetic calorimeter with a novel *accordion" geometry. The CMS design 
[36] was based on a single large-bore, long, high-field solenoid for analyzing muons, 
together with powerful microstrip-based inner tracking and an electromagnetic 
calorimeter comprising scintillating crystals. 

On top of the selected event of interest, an average of up to around 40 other 
proton-proton events are superimposed. These superposed events are referred to as 
minimum-bias events, because no selection is made. Thus thousands of particles 
emerge from the interaction region every 25 ns where one nanosecond (ns) = 107? s. 
Hence the products of an interaction under study can be confused with those from 
other interactions in the same bunch crossing. This problem, known as pileup, 
clearly becomes more severe if the response time of a detector element and its 
electronic signal is longer than 25 ns. The effect of pileup can be reduced by using 
highly granular detectors with fast, short duration, signals, giving low occupancy 
(i.e., a low probability that a detector element will give a signal) at the expense 
of having large numbers of detector channels. The resulting millions of electronic 
channels require very good time synchronization. 

The large flux of particles emanating from the interaction region creates a high- 
radiation environment requiring radiation-hard detectors and front-end electronics. 
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Access for maintenance is very difficult, time consuming, and highly restricted. 
Hence, a high degree of long-term operational reliability had to be attained, 
comparable to that which is usually associated with instruments flying on space 
missions. 

The event selection process (called the trigger) must select among the billion 
interactions that occur each second since no more than a thousand events per second 
can be stored for subsequent analysis. The short time between bunch crossings, 
25 ns, has major implications for the design of the readout and trigger systems. It 
takes a long time to make a trigger decision, yet new events occur in every crossing 
and a trigger decision must be made for every crossing; the selection process is split 
in several levels. The first of these is the Level-1 trigger decision, which takes about 
3 ws and selects, on average, one crossing out of 400. During this time the data 
must be stored in pipelines integrated into the front-end electronics. In CMS, the 
data from these selected events are then moved into a commercial farm of CPUs to 
select and store about one thousand/s of the most interesting events for subsequent 
analysis. 

It cannot be stressed enough how important were the many years of R&D and 
prototyping that preceded the start of detector construction. Technologies had to 
be developed far beyond what was the state-of-the-art in early 1990s, in terms of 
granularity, speed of readout, radiation tolerance, reliability, and very importantly 
cost. For many detector subsystems, there were initially several technologies 
considered, as it was far from certain which technologies would be able to attain the 
required performance. In many cases several variants were developed, prototyped 
and tested, before choosing the one best able to fulfill the stringent requirements. 
This involved building and testing increasingly more realistic and larger prototypes, 
in a process that involved industry from the outset. This took place over a number 
of years before construction commenced in the second half of the 1990s. 

In the 1990’s the two collaborations, ATLAS and CMS, grew rapidly in terms 
of people and institutes. Today each comprises over 3500 scientists and engineers, 
from over 150 institutions in more than 40 countries. The talents and resources of 
all these scientists were needed to build the experiments, which are now performing 
extraordinarily well at the LHC. 

The single most important aspect of the experiment design and layout is the 
magnetic field configuration for the identification of muons and the measurement 
of their momentum. Large bending power is needed to measure precisely the 
momentum of charged particles. This forces a choice of superconducting technology 
for the magnets. The design configurations chosen by ATLAS and CMS are 
discussed below. 


6.4.2 The ATLAS Detector 


The design of the ATLAS detector [35], shown in Fig. 6.3 (top), is based on 
a novel superconducting air-core toroid magnet system, containing ~80 km of 
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Superconducting Solenoid 
Silicon Tracker 


Compact Muon Solenoid 


Fig. 6.3 Schematic longitudinal cut-away views of (top) the ATLAS and (bottom) the CMS 
detectors, showing the different layers around the LHC beam axis, with the collision point in the 
centre 


superconductor cable, in eight separate barrel coils (each 25 x 5 m? in a ‘racetrack’ 
shape) and two matching endcap toroid systems. A field of ~0.5 Tesla is generated 
over a large volume. The toroids are complemented with a thin superconducting 
central solenoid (2.4 m diameter, 5.3 m length) that provides an axial magnetic field 
of 2 Tesla. 
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The electromagnetic calorimeter consists of a lead/liquid-argon sampling 
calorimeter in a novel ‘accordion’ geometry. A plastic scintillator—iron sampling 
hadron calorimeter, also with a novel geometry, is used in the barrel part of 
the experiment. Liquid-argon hadronic calorimeters are employed in the endcap 
regions near the beam axis. The electromagnetic and hadronic calorimeters have 
almost 200,000 and 20,000 cells, respectively, and are in an almost field-free region 
between the toroids and the solenoid. 

The momentum of the muons is precisely measured after traversing the calorime- 
ters in the air-core toroid field over a distance of ~5 m. About 1200 large muon 
chambers of various shapes, with a total area of 5000 m?, measure the impact 
position with an accuracy of better than 0.1 mm. Another set of about 4200 fast 
chambers is used to provide the “trigger”. 

The reconstruction of all charged particles, and that of displaced vertices, is 
achieved in the inner detector, which combines highly granular pixel (50 x 400 ym? 
elements, leading to 80 million channels) and microstrip (13 cm x 80 jum elements, 
leading to six million channels) silicon semiconductor sensors placed close to the 
beam axis, and a ‘straw tube’ gaseous detector (350,000 channels) which provides 
about 30-40 signal hits per track. The latter also helps in the identification of 
electrons using information from the effects of transition radiation. 

The air-core magnet system allows a relatively lightweight overall structure 
leading to a detector weighing 7000 tons. The muon spectrometer defines the overall 
diameter of 25 m and length of 44 m of the ATLAS detector. 


6.4.5 The CMS Detector 


The design of the CMS detector [36], shown in Fig. 6.3 (bottom), is based on a 
state-of-the-art superconducting high-field solenoid, which first reached the design 
field of 4 Tesla in 2006. 

The solenoid generates a uniform magnetic field parallel to the direction of the 
LHC beams. The field is produced by a current of 20 kA flowing through a rein- 
forced Nb-Ti superconducting coil built in four layers. Economic and transportation 
constraints limited the outer radius of the coil to 3 m and its length to 13 m. The 
field is returned through a 1.5 m thick iron yoke, which houses four muon stations 
to ensure robustness of identification and measurement and full geometric coverage. 

The CMS design was first optimized to cleanly identify, trigger and measure 
muons, e.g. arising from processes such as H — ZZ) — 4 y and few TeV mass 
Z — 2 y, over a wide range of momenta. The muons trace a spiral path in the 
magnetic field and are identified and reconstructed in -3000 m? of gas chambers 
interleaved with the iron plates in the return yoke. Another ~500 fast chambers are 
used to provide a second system of detectors for the Level-1 muon trigger. 

The next design priority was driven by the search for the decay of the SM Higgs 
boson into two photons. A new type of scintillating crystal was selected: lead- 
tungstate (PbWO,) crystal. 
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The solution to charged particle tracking was to opt for a small number of precise 
position measurements of each charged track (~13 each with a position resolution 
of ~15 jum per measurement) leading to a large number of cells distributed inside a 
cylindrical volume 5.8 m long and 2.5 m in diameter: 66 million 100 x 150 pm? 
silicon pixels and 9.3 million silicon microstrips ranging from ~10 cm x 80 um to 
-20 cm x 180 jum. The 198 m? area of active silicon of the CMS tracker is by far 
the largest silicon tracker ever built. 

Finally the hadron calorimeter, comprising ~3000 projective towers covering 
almost the full solid angle, is built from alternate plates of ~5 cm brass absorber 
and ~4 mm thick scintillator plates that sample the energy. The scintillation light 
is detected by photodetectors (hybrid photodiodes) that can operate in the strong 
magnetic field. 


6.4.4 Installation and Commissioning 


The two very different and complementary detector concepts, ATLAS and CMS, 
resulted in two different strategies for the underground installation of these experi- 
ments. 

Given its size and its magnet structure, the ATLAS detector had to be assembled 
directly in the underground cavern. The installation process began in summer 2003 
(after the completion of civil engineering work that started in 1998) and ended in 
summer 2008. Figure 6.4 (top) shows the completion of the barrel toroid magnet 
system with the insertion of the barrel calorimeters. Figure 6.4 (bottom) shows one 
end of the cylindrical barrel detector after 3.5 years of installation work, 1.5 years 
before completion. The ends of four of the barrel toroid coils are visible, illustrating 
the eightfold symmetry of the structure. 

The iron yoke of the CMS detector is divided into five barrel-wheels and three 
endcap disks at each end, giving a total weight of 12,500 tons. This structure 
enabled the detector to be assembled and tested in a large surface hall while the 
underground cavern was being prepared. The sections, weighing between 350 tons 
and 2000 tons, were then lowered sequentially between October 2006 and January 
2008, using a dedicated gantry system equipped with strand jacks: a pioneering 
use of this technology to simplify the underground assembly of large experiments. 
Figure 6.5 top shows the lowering of the heaviest and central section, supporting the 
superconducting coil. Figure 6.5 bottom shows the transverse section of the barrel 
part of CMS illustrating the successive layers of detection starting from the centre 
where the collisions occur: the inner tracker, the crystal calorimeter, the hadron 
calorimeter, the superconducting coil, and the iron yoke instrumented with the four 
muon stations. The last muon station is at a radius of 7.4 m. 

Individual detector components (e.g. chambers) of both experiments were 
built and assembled in a distributed way all around the globe in the numerous 
participating institutes and were typically first tested at their production sites, then 
after delivery to CERN, and finally again after their installation in the underground 
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Fig. 6.4 (top) Photograph of the barrel toroid magnet system after the completion of the the 
installation of the eight coils, (bottom) Photograph of one end of the ATLAS detector barrel with 
the calorimeter end-cap still retracted before its insertion into the barrel toroid magnet structure 
(February 2007 during the installation phase) 


caverns. The collaborations also invested enormous effort in testing representative 
samples of the detectors in test beams at CERN and other accelerator laboratories 
around the world. These test-beam campaigns not only verified that performance 
criteria were met over the several years of production of detector components, but 
also were used to prepare the calibration and alignment data for LHC operation. The 
so-called large combined test-beam setups, which represented whole ‘slices’ of the 
different detector layers of the final detectors, proved to be very important. 
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Fig. 6.5 (top) Photograph showing the lowering of the central barrel part and solenoid of the CMS 
detector during its installation in the cavern in 2007; (bottom) Photograph of transverse section of 
the barrel part of CMS illustrating the successive layers of detection 
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During the installation, the experiments made extensive use of the constant flow 
of cosmic rays impinging on Earth providing a reasonable flux of muons even at 
a depth of 100 m underground. Typically a few hundred per second traverse the 
detectors. These muons were used to check the whole chain from sub-detector 
hardware to analysis programs of the experiments, and to align the detector elements 
and calibrate their response prior to the proton-proton collisions. In particular, after 
the LHC incident on 19th September 2008 the experiments used the 15 months 
LHC down time, before the first collisions on 23rd November 2009, to run the 
full detectors in very extensive cosmic-ray campaigns, collecting many hundreds 
of millions of muon events. These runs allowed both ATLAS and CMS to be ready 
for physics operation, with pre-calibrated and pre-aligned detectors, by the time of 
the first pp collisions. 


6.5 Experiment Software and LHC Worldwide Computing 
Grid 


The experiment collaborations themselves develop the software that enables recon- 
struction, from raw data, of analyzable objects such as electrons, photons, jets, b 
jets, muons, and other charged tracks, and their energies or momenta. Algorithms 
have to be run to calibrate the energy deposits; align the hits from charged particles; 
and correct for changes in detector response arising from irradiation, variation 
in environmental parameters such as temperature, or changes in the position of 
detecting elements. The software packages must also simulate the response of 
the detectors to the passage of particles generated in simulated events occurring 
in bunch crossings that contain interesting physics processes, as well as simple 
backgrounds. These include processes such as the production of W or Z bosons, 
QCD jets, or Higgs bosons and their decays. Such simulations helped prepare, prior 
to the first collisions, the experiments' end-to-end processing and analysis chains, 
which were crucial for the rapid delivery of physics results of outstanding quality 
and quantity soon after the first collisions. 

The LHC computing system, termed the LHC Worldwide Computing Grid 
(WLCG) [1], was conceived to make effective use of distributed resources, work 
on a large scale, and enable all the experiments' scientists, wherever they were 
based, to have access to LHC data, and without regard to the extent of the resources 
they themselves could afford. The WLCG provided the backbone for the analysis 
capabilities of the experiments. The global WLCG has continued to grow, now 
encompassing around 170 computing centers in 42 countries, with an infrastructure 
that provides access to some 600,000 computing cores, around 500 PB of storage 
(50% on disk and 50% on tape), and a network that frequently runs at 100 Gb s7! 
between larger sites and at 10 Gb s~! between smaller sites. The security of access 
have been instrumental for building a truly federated computing infrastructure for 
science. Although the individual computing tasks described above were already 
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familiar in particle physics, the scope, scale, and geographical spread of the LHC 
computing and data analysis are unprecedented. 


6.6 Operation of the LHC: The Start of Data Taking 


On the tenth of September 2008 first beams circulated in the Large Hadron Collider. 
Nine days later, during the powering test of the last octant, alarms reached the 
LHC accelerator's control room and safety systems were activated to protect the 
accelerator. It turned out that one of the 50,000 soldered joints had malfunctioned. 
This led to an electrical arc that pierced the vacuum enclosure of a superconducting 
dipole bending magnet leading a massive escape of helium, the pressure wave 
of which caused considerable damage. The accelerator went offline for repairs. 
The ATLAS and CMS experiments continued to run round-the-clock for a few 
months recording billions of traversals of muons from cosmic rays. These data 
demonstrated that the experiments were in a good shape to take collision data. After 
a few tweaks the ATLAS and CMS experiments were even better prepared for first 
collisions, which came on 23rd November 2009. The first collision data were rapidly 
distributed, analysed and physics results produced. 

Following a preliminary low-energy run in the autumn of 2009, the ATLAS 
and CMS experiments started recording high-energy proton-proton collisions in 
March 2010 at ys = 7 TeV. Some 45 pb! of data were recorded, sufficient to 
demonstrate that the experiments were working well, according to the ambitious 
design specifications and the results they were producing were consistent with the 
predictions from known SM physics. Many parameters were examined, including 
the efficiency of identification and reconstruction of physics objects, the measured 
energy and momentum resolutions, the resolution of peaks in invariant mass 
distribution, and more. An example of the performance from the CMS experiment is 
the comparison of the observed width of Y particle with the design mass resolution. 
The width is expected to be dominated by instrumental resolution. Figure 6.6 shows 
that the observed width is measured to be 70 MeV consistent with the design value. 
Also observed in such di-muon invariant mass distributions is a history of decades of 
particle physics indicating the excellent performance of the experiments. The next 
step was to see if known physics could be measured as per the predictions of the 
SM, extrapolated to the new energies. 


6.6.1 Measurement of SM Processes to Verify Experiment 
Performance 


Observation and accurate measurement of the production of known SM particles 
at the LHC collision energies is a pre-requisite for the exploration of new physics, 
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Fig. 6.6 The distribution of the invariant mass for di-muon events, shown here from CMS, 
displays the various well-known resonant states of the SM. The inset illustrates the excellent mass 
resolution for the three states of the Y family. The mass resolutions in the central region are; 
28 MeV (0.9%) for Jy, 69 MeV (0.7%) for Y(1S), both dominated by instrumental resolution and 
T = 2.5 GeV for the Z dominated by its natural width, and are equal to the design values 


including the search for the Higgs boson. The SM processes, such as W and Z 
production, are often considered to be “standard candles’ for the experiments. 

In the ATLAS and CMS experiments, SM physics can be studied with unprece- 
dented precision, allowing comparison with the predictions of the SM with small 
instrumental systematic errors. The data collected so far have enabled many precise 
measurements of SM processes, including the production of light quarks and gluons, 
bottom and top quarks, and W and Z bosons, singly and in pairs, and with varying 
numbers of jets resulting from higher order processes. A summary of such studies 
is shown in Fig. 6.7, where measurements of cross sections for various selected 
electroweak and QCD processes are compared with predictions from the SM. These 
very diverse measurements, probing cross-sections over a range of many orders 
of magnitude, established that the experiments were "physics commissioned" and 
ready for discoveries. The detector performance was well understood and known 
SM processes were correctly observed, crucially important as they often constitute 
large backgrounds to signatures of new physics, such as those expected for the Higgs 
boson. 

The speed with which these measurements verified the SM predictions for known 
physics is a tribute to the large amount of work done by many groups, including 


282 P. Jenni and T. S. Virdee 


Standard Model Production Cross Section Measurements Status: July 2018 
T P" 1 
Q. 10! En4o nessi ATLAS Preliminary d 
ER F Theory 3 
b = o Run 1,2 y5 = 7,8,13 TeV = 
106 E Zo LHC pp V5 - 7 TeV 3 
p EN 0325 
or O n 3 
Pr>25 GNV LHC pp vs - 8 TeV 
0 F Dezo BE Daa 20.2- 20.36 3 
E zo j 
oL us LHC pp Vs = 13 TeV 1 
E E |- Daa 3.2 79.81 E 
wey 
02 E. S W, E 
9^9 go, 3 
zx. p 
E^ On 
ot b zu Dor d 
S4 cw 
D. w 
Wh 
B u^ 
le D^. o off Bg 3 
EB e 
10k Hs LE iw To J 
Um p^ 3 
Hovyy o 
10? F 5 d 
Mazza B unm ww 
ü 
5 
102 F i gt 
owe 
PP Jets Y w z tt t VV YY H WV Vy ttW tZ ttH tty Y Wij Zjj WWZy Wy yr ZyjjVVij 
EWK EWK Exch ew 


tot. tot. tot. tot, tot tot 


Fig. 6.7 A comparison of cross-section measurements for electroweak and QCD processes with 
theoretical predictions from the SM, shown here as example from the ATLAS experiment 


theorists, other collider experiments at LEP, Tevatron, HERA, b-factories and to the 
good preparation of the ATLAS and CMS experiments. 

In what follows the production of b-bar, t17, W*W_, etc. will be denoted by 
bb, tt, WW), etc. 

Using all the data so far collected extensive searches for new physics, beyond 
the standard model, have been performed. No new physics beyond the SM has yet 
been discovered. Limits have been set on e.g. quark substructure, supersymmetric 
particles (e.g. disfavouring gluino masses below 1.5 TeV in simple models of 
supersymmetry), potential new bosons (e.g disfavouring new heavy SSM W' and Z’ 
bosons with masses below 3 TeV for couplings similar to the ones for the known W 
and Z bosons) and semi-classical black holes in the context of large extra dimensions 
(with masses below 10 TeV). 


6.7 The Discovery and Properties of a Higgs Boson 


Undoubtedly, the most striking result to emerge from the ATLAS and CMS 
experiments is the discovery of the Higgs boson at a mass of «125 GeV [19] and 
[20], respectively. 
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The SM Higgs boson couples to the different pairs of particles in a set proportion 
i.e. for fermions (f) proportional to m;?, and for bosons (V) proportional my^/? 
where v is the vacuum expectation value of the scalar field (v = 246 GeV). Once 
produced the Higgs boson disintegrates immediately into known SM particles. Both 
the production modes and decay modes and rates are precisely predicted in the 
SM. A search had to be made over a large range of masses and the many possible 
decay modes with differing branching ratios as shown in Fig. 6.2 and Table 6.1. For 
example, at mg — 125 GeV the SM boson is predicted to decay into pairs of photons 
with branching fraction (BR) of 2.2 x 107, into Z bosons and then four electrons 
or muons or two muons and two electrons with BR = 1.25 x 107^, into a pair of W 
bosons and then into //vv with BR ~1%, etc. 

For a given Higgs boson mass hypothesis, the sensitivity of the search depends 
on: 


— the mass of the Higgs boson 

— the Higgs boson production cross section (Fig. 6.2a and Table 6.1), 

— the decay branching fraction into the selected final state (Fig. 6.2b and Table 6.1), 
— the signal selection efficiency, 

— the observed Higgs boson mass resolution, and 

— the level of backgrounds with the same or a similar final state. 


Comparisons with the expectations from the SM for the various combinations of 
production and decay modes are usually cast in terms of modifiers such as signal 
strength, p, that is the ratio of the measured production x decay rate of the signal 
and the SM expectation i.e. ų = o.BR/(o.BR)sm. Signal strength of one would be 
indicative of the SM Higgs boson. 

CMS and ATLAS increasingly use global event reconstruction algorithms, 
labeled particle-flow reconstruction, that attempt to identify, reconstruct and provide 
the measurement of the energy of particles by combining information from the inner 
tracker, the calorimeters and the muon system in an optimized manner. Hadronic jets 
are clustered from the reconstructed particles using the infrared- and collinear-safe 
anti-kr algorithm with a distance parameter usually set at 0.4. The jet momenta 
are measured by summing vectorially the momenta of all particles in the jet. Jets 
originating from b-jets are identified by discriminants that include the presence of 
particles originating from vertices displaced from the primary interaction vertex. 
A typical b-jet efficiency of around 70% is attained for a 1% misidentification 
probability for light quarks and gluons. The missing transverse momentum vector is 
taken as the negative of the vector sum of the momenta of all reconstructed particles 
in the event; its magnitude is labeled pr™’’. 

By the end of 2012 (LHC Run 1) the total amount of data that had been examined 
corresponded to an integrated luminosities of ~5 fb^! at ./s = 7 TeV and ~ 20 
fb! at J/s = 8 TeV, equating to the examination of some 2 x 10? proton-proton 
collisions. 

By the end of the Run 2 (2018) the total amount of data that had been recorded 
corresponded to an integrated luminosities ~150 fb^! at /s = 13 TeV, equating to 
some 1.5 x 1016 proton-proton collisions. 
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6.7.1 Event and Physics Objects Reconstruction and Analysis 
Techniques 


It is convenient to subdivide the analysis of data relating to Higgs bosons according 
to the decay channel, using datasets where the data have been selected to contain 
a particular set of final state particles. To improve the sensitivity, the events in this 
dataset are usually separated into categories that are intended to reflect the expected 
signal-to-background ratio. Several multivariate methods are used in the analyses 


— to improve event reconstruction, estimates of the energies/momenta of physics 
objects (e.g. photons and electrons, etc.), 

— to identify physics objects (such as electrons, photons, b-quarks, tau leptons, etc.) 

— to categorize events according to particular production (e.g. ggH, VBF, VH, ttH 
etc.), decay mode, or expected signal-to-background ratio. 


The reader can find the exact description of the multivariate methods used within 
the individual papers referenced in the sections below. 

Charged leptons and photons originating from the fundamental partonic pro- 
cesses tend to be “isolated” i.e. no other particles surround the one of interest. 
A relative isolation condition is applied on such particles. The sum of transverse 
momenta of accompanying particles, within an angular radius of approximately 0.3, 
around the particle of interest, is divided by the transverse momentum of the particle 
of interest. A cut on this ratio is made, the value of which is separately optimized for 
electrons, muons or photons. A correction to the accompanying energy is applied 
when the instantaneous luminosity is high and undesirable energy from pileup 
interactions is accidentally captured in the region. 


6.7.2 The Discovery: Results from the 2011 and Partial 2012 
Datasets 


In the 2011 data-taking run the ATLAS and CMS experiments recorded data at 
M/s = 7 TeV corresponding to an integrated luminosity of ~5 fb~!. In December 
2011, the first “tantalizing hints" of a new particle from both the CMS and 
ATLAS experiments were shown at CERN. The general conclusion was that both 
experiments were seeing an excess of unusual events at roughly the same place in 
mass (in the mass range 120-130 GeV) in two different decay channels. That set the 
stage for data taking in 2012. 

In January 2012 it was decided to slightly increase the energy of the protons 
from 3.5 to 4 TeV, giving a centre of mass energy of 8 TeV. By June 2012 the 
number of high-energy collisions examined had doubled and both CMS and ATLAS 
had greatly improved their analyses so it was decided to look at the area which 
had shown the excess of events but only after all the algorithms and selection 
procedures had been agreed, in case a bias was inadvertently introduced. These data 
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led to the discovery of a Higgs boson, independently in both the ATLAS and CMS 
experiments in July 2012. 

In this section we shall concentrate on the region of low mass (114.4 « mg « 150 
GeV) where the two channels particularly suited for unambiguous discovery are the 
decays to two photons and to two Z bosons, where one or both of the Z bosons could 
be virtual, subsequently decaying into four electrons, four muons or two electrons 
and two muons. These two decay modes are particularly suited for discovery as 
the observed mass resolution (~1% of my) is the best and the backgrounds are 
manageable or small. 


6.7.2.1 The H — yy Decay Mode 


In the H > yy analysis a search is made for a narrow peak in the diphoton invariant 
mass distribution in the mass range 110-150 GeV, on a large irreducible background 
from QCD production of two photons (via quark-antiquark annihilation and the 
gluon-fusion or “box” diagrams). There is also a reducible background where one or 
more of the reconstructed photon candidates originate from misidentification of jet 
fragments, with the process of QCD Compton scattering dominating. The relative 
fractions of these backgrounds in the selected events are illustrated in Fig. 6.9a. 

The event selection requires two “isolated” photon candidates satisfying pr 
and photon identification criteria. As an example, CMS applies a threshold of 
PT = myy/3 (my,y/4) to the leading (sub-leading) photon in pr, where m,y is the 
diphoton invariant mass. Scaling the pr thresholds in this way avoids distortion of 
the shape of the m,,,, distribution. The background is estimated from data, without 
the use of MC simulation, by fitting the diphoton invariant mass distribution in a 
range (100 < my, < 180 GeV). 

The results from the CMS experiments are shown in Fig. 6.8a [20]. A clear peak 
at a diphoton mass of around 125 GeV is seen. A similar result was obtained in the 
ATLAS experiment [19]. 


6.7.3.3 The H > ZZ) — 41 Decay Mode 


In the H — ZZ) — 4 | decay mode a search is made for a narrow four-charged 
lepton mass peak in the presence of a small continuum background. The background 
sources include an irreducible four-lepton contribution from direct ZZ“) production 
via quark-antiquark and gluon-gluon processes. Reducible background contribu- 
tions arise from Z + bb and tt production where the final states contain two isolated 
leptons and two b-quark jets producing secondary leptons. 

The event selection requires two pairs of same-flavour, oppositely charged 
isolated leptons. Since there are differences in the reducible background rates and 
mass resolutions between the sub-channels 4e, 4 u, and 2e2j, they are analysed 
separately. Electrons are typically required to have pr » 7 GeV. The corresponding 
requirements for muons are pr > 5 GeV. Both electrons and muons are required to 
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Fig. 6.8 (a) The two-photon invariant mass distribution of selected candidates in the CMS 
experiment, weighted by S/B of the category in which it falls. The lines represent the fitted 
background and the expected signal contribution (my = 125 GeV). (b) The four-lepton invariant 
mass distribution in the ATLAS experiment for selected candidates relative to the background 
expectation. The expected signal contribution (my = 125 GeV) is also shown 


be isolated. The pair with invariant mass closest to the Z boson mass is required to 
have a mass in the range 40-120 GeV and the other pair is required to have a mass 
in the range 12-120 GeV. The ZZ background, which is dominant, is evaluated 
from Monte Carlo simulation studies. 

The m4 distribution is shown in Fig. 6.8b for the ATLAS experiment [19]. A 
clear peak is observed at ~125 GeV in addition to the one at the Z mass. The latter 
is due to the conversion of an inner bremstrahlung photon emitted simultaneously 
with the dilepton pair. A similar result was obtained by the CMS experiment [20]. 


6.7.2.3 Combinations 


A search was also made in other decay modes of a possible Higgs boson and 
combined to yield the final results published in August 2012 by ATLAS [19] and 
CMS [20] experiments. Both ATLAS and CMS independently discovered a new 
heavy boson at approximately the same mass, clearly evident in the two different 
decay modes, yy and ZZ“), The observed (expected) local significances were 6.00 
(5.00) and 5.00 (5.80) in ATLAS and CMS respectively, indicating that a new 
particle had been discovered. 

The decay into two bosons (two photons; two Z bosons or two W bosons) implied 
that the new particle is a boson with spin different from one, and its decay into two 
photons that it carries either spin-O or spin-2. 

The results presented by both ATLAS and CMS collaborations were consistent, 
within uncertainties, with the expectations for a SM Higgs boson. Both noted that 
collection of more data would enable a more rigorous test of this conclusion and an 
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investigation of whether the properties of the new particle imply physics beyond the 
SM. 


6.7.3 Results from the Data Recorded Subsequent 
to the Discovery 


The combined results from the ATLAS and CMS experiment from Run 1 on the 
Higgs boson production, decay rates and constraints on its couplings were published 
in 2016 [37]. These results have been superseded by the ones presented below. 
Results are presented from the most recently published papers (in journals or 
submitted to the hep arXiv) from the two collaborations. The integrated luminosity 
differs from one result to another and is indicated in the legends of the plots 
presented. 

The LHC centre of mass energy was increased from ys = 8 TeV to ,/s = 13 TeV 
in 2015. At the higher value of ./s the predicted cross-sections for the dominant 
ggH production mode and the rare ttH production mode increased by factors of 
~2.3 and ~ 3.8, respectively. This and the larger datasets from Run 2 allow a more 
precise comparison of the properties of the Higgs boson with respect to those 
predicted by the SM. In addition, since the discovery, the theoretical predictions 
have become more accurate with the inclusion of further (higher) order corrections. 
Details can be found below in the references included in the individual papers of the 
two collaborations. 

The two collaborations have also improved the reconstruction of physics objects 
and the methods of analysis. Event categorization and machine learning methods are 
deployed to study almost all the different production and decay modes. The analyses 
described below divide events into multiple categories reflecting the different 
Higgs boson production channels to improve the sensitivity of the measurements. 
Associated production processes (WH and ZH), or the ttH production process, are 
tagged by requiring the presence of additional leptons or jets. The VBF process is 
tagged using distinctive kinematic properties such the presence of two jets with a 
large separation in pseudorapidity and a large invariant jet-jet mass. In some cases 
the kinematic characteristics of the whole event, such as large missing pr, are used 
to preferentially select events e.g. arising from ZH production where the Z boson 
decays to neutrinos. 


6.7.3.1 The H — yy 


As the H — yy decay proceeds via W-boson and top-quark loops, it is especially 
sensitive to the presence of any undiscovered heavy charged fermions and bosons. 
Any significant deviation from the precise SM prediction for the cross section would 
be indicative of new physics. 
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The H — yy mode provides good sensitivity to almost all Higgs boson 
production processes. The interference between W-loop and top-loops provides 
sensitivity to the relative sign of the fermion and boson couplings. 

It is common to use a dedicated boosted decision tree discriminator to select 
and categorize events; it is constrained using the diphoton kinematic variables, 
photon isolation and identification variables, and per-event estimated diphoton mass 
resolution for the pair of photons in the event. 

ATLAS has measured the properties of the H — y y mode using 79.8 fb! of 
collision data recorded at ./s = 13 TeV [38]. The properties measured include 
the signal strength, the cross section measurements for the production of a Higgs 
boson through gluon-gluon fusion, vector boson fusion, and in association with 
a vector boson or a top-quark pair. They are found to be compatible with the 
predictions of the SM. The signal strength is measured to be ju = 1.06 + 0.08 
(stat), *908 557 (exp), *0.07 o 96 (theo), improving on the precision of the previous 
ATLAS measurement at ,/s = 7 and 8 TeV by over a factor of three. The cross 
section for the production of the Higgs boson decaying to two isolated photons in the 
fiducial region of the selection of photons is measured to be 60.4 + 6.1 (stat) + 6.0 
(exp) + 60.3 (theo) fb, in good agreement with the SM value of 63.5 + 3.3 fb. The 
differential cross section, sensitive to higher order QCD corrections and properties 
of the Higgs boson, such as its spin and CP quantum numbers, is illustrated in Fig. 
6.9b and no significant deviation from a wide array of SM predictions is observed. 

CMS has reported results from the H — yy decay channel based on data 
collected at ./s = 13 TeV corresponding to an integrated luminosity of 35.6 fb! 
[39]. The diphoton invariant mass distribution, observed in CMS, is shown in 
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Fig. 6.9 (a) The data-driven determination of event fractions for yy, y-jet, and jet-jet events as a 
function of m,y after the final selection. (b) The fiducial differential cross section for pp. > yy as 
a function of pr(yy) compared to the SM expectations 
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Fig. 6.10 (a) Data and signal-plus-background model fits for all categories summed, weighted 
by their sensitivity. The one (green) and two (yellow) standard deviation bands include the 
uncertainties in the background component of the fit. The lower panel shows the residuals after 
the background subtraction. (b) Cross section ratios measured for each process (black points) 
compared to the SM expectations and their uncertainties (blue band). The signal strength modifiers 
are constrained to be nonnegative, as indicated by the vertical line and hashed pattern at zero 


Fig. 6.10a. The measured signal strength is found to be 1.18 +°!7_ 44, largely 
insensitive to the precise mass value assigned to the Higgs boson. Signal strengths 
associated with the different Higgs boson production mechanisms are shown in Fig. 
6.10b and found to be compatible with the expectations from the SM. 


6.7.3. H — ZZ) — 41 Decay Mode 


The Higgs boson decay H — ZZ? — 4 lis the most significant process in 
constraining the HZZ coupling. To study the differing production mechanisms 
involved, the events are categorized on the basis of the presence of jets, b-tagged 
jets, leptons, pr?55, and various matrix element discriminants that make use of 
the information about the additional objects: VBF (1- and 2-jet), VH hadronic, VH 
leptonic, ttH, VH pr™*s, and untagged categories. 

ATLAS has studied the coupling properties of the Higgs boson in the four- 
lepton (eL) decay channel using 36.1 fb-! of pp. collision data recorded at 
A/s = 13TeV [40]. The four-lepton invariant mass distribution is illustrated in Fig. 
6.1 1a. Cross sections are measured for the main production modes and the ratio of 
sigma.BR/(sigma.BR)sm are plotted in Fig. 6.11b. The inclusive cross section times 
branching fraction for H — ZZ) decay and for a Higgs boson absolute rapidity 
below 2.5 is measured to be 1.733924 954 pb. the statistical error dominating, 
compared to the SM prediction of 1.34 + 0.09 pb. 
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Fig. 6.11 (a) Observed and expected four-lepton invariant mass distribution for the selected Higgs 


boson candidates with a constrained Z boson mass. (b) Observed and expected SM values of the 
cross-section ratios o.B normalized by the SM expectation (o.B)sm for the inclusive production 


CMS Preliminary 2016 » 2017 A fb E : 
e BOO Re OT LEER UTR) = 4 CMS Preliminary 41515 (13 TeV) 
8 [- * Data i > t 2N 2 
= 200 , @H(125) = u E PX HoOZZo4 i 
` E 4 CUqqozzzr  - P 95r i \ my = 125.09 GeV + 
a 180r Bgg- Zz  —- E I \ i 
S 160E BUz.x E sr l Y —68% C.L. 4 
T E J F | \ -——9595 CL 1 
a z : \ ] 
mp E 25R | \ + bestfit -4 
120F d 2 E | \ * SM i 
100E- | E f | \ 1 
80 E 4 1.5 E \ y E 
E =| E \ 1 
60 [- -J 1E \ 
40 - i E \ 
rF f | 0.5 F 
20r. "A - F 
Le i” 1 of 
80 100 200 300 400500 700 900 0 0.5 1 15 2 25 3 
H 
ma, (GeV) goH,bbH,tiH.taH 
(a) (b) 


Fig. 6.12 (a) Distribution of the four-lepton reconstructed invariant mass mą; in the full mass 
range, combining data from 2016 an 2017. Points with error bars represent the data and stacked 
histograms represent expected distributions of the signal and background processes. The ZZ“? 
backgrounds are normalized to the SM expectation, the Z + X background to the estimation from 
data. (b) Result of the 2D likelihood scan for the u(ggH, ttH, bbH, tqH) and (VBF, VH) signal- 
strength modifiers 


CMS has studied the coupling properties using 77.4 fo~! of pp. collision data 
recorded at ./s = 13TeV [41]. The four-lepton invariant mass distribution is 
illustrated in Fig. 6.12a. The signal strength is measured to be u = 1.06*9? o 13 
at my = 125.09 GeV, the combined ATLAS and CMS measurement of the Higgs 
boson mass [42]. The result of a 2D likelihood scan of the signal strengths for the 
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individual Higgs boson production modes are also measured and are shown in Fig. 
6.12b. All measurements are consistent with the expectations from the SM. 


6.7.3.3 H — WW) — 212v Decay Mode 


The H — WW? decay mode has a large branching fraction (~20%) and a relatively 
low-background final state. The study of this final state in which both W bosons 
decay leptonically is based on the signature with two isolated, oppositely charged, 
high pr leptons (electrons or muons) and large missing transverse momentum, 
Er”, due to the undetected neutrinos. The signal sensitivity is improved by 
separating events according to lepton flavor; into ete~, + 1~, and ew samples and 
according to jet multiplicity into 0-jet and 1-jet samples. The dominant background 
arises from irreducible non-resonant WW? production, and the dominant uncer- 
tainties arise from the estimation, using the data themselves, of the backgrounds 
from top quark pair, WW“) and DY production. 

The final states are categorized according to the number of associated jets, with 
the 0-jet category dominating the overall sensitivity. Events are selected that contain 
two leptons of either different or the same flavour. The large background from tt 
production, the different and same flavour final states are further categorized with 0, 
] and 2 associated jets. In the different-flavour final state, dedicated 2-jet categories 
are included to enhance the sensitivity to VBF and VH production mechanisms. 

ATLAS has presented measurements of the inclusive cross section of Higgs 
boson production via the gluon-gluon fusion (ggF) and vector-boson fusion (VBF) 
modes [43], based on an integrated luminosity of 36.1 fb! recorded at ~s = 13 TeV 
in 2015-2016. The combined transverse mass distribution for Njet < 1 is shown in 
Fig. 6.13a. The ggF and VBF cross-sections times the H — WW? branching ratio 
are measured to be 12.6 + 1.0(stat) *'? ; s (syst) pb and 0.50 + 0.24 (stat) +0.18 
(syst) pb, respectively, in agreement with the SM predictions, as illustrated in Fig. 
6.13b. 

CMS has published results on the decay mode H — WW? using data cor- 
responding to an integrated luminosity of 35.9 fb-!, collected at Js = 13 TeV 
during 2016 [44]. The expected relative fraction of different Higgs boson production 
mechanisms in each category is shown in Fig. 6.14a, together with the expected 
signal yield. Combining all channels, the observed cross section times branching 
fraction is 1.28+°!8 9 17 times the SM prediction for the Higgs boson with a mass 
of 125.09 GeV. The ratio of the observed and the predicted cross sections for the 
main Higgs boson production modes is shown in Fig. 6.14b. All are consistent with 
the predictions from the SM. 


6.7.3.4 The H — tt Decay Mode 


All of the decay modes discussed so far test the direct coupling of the Higgs boson to 
bosons, and only indirectly probe, through quantum loops, its coupling to fermions. 
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Fig. 6.13 (a) Post-fit combined transverse mass distribution for Njet < 1. The bottom panel shows 
the difference between the data and the estimated background compared to the distribution for a 
SM Higgs boson with my = 125GeV. The Hypr contribution is too small to be visible. (b) 68% 
and 95% confidence level two-dimensional likelihood contours of oggr-By > WW) vs. ovpr: 
By — WW), compared to the SM prediction shown by the red marker 
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Fig. 6.14 (a) Expected relative fraction of different Higgs boson production mechanisms in each 
category included in the combination, together with the expected signal yield. (b) Observed cross 
sections for the main Higgs boson production modes, normalized to the SM predictions. The 
vertical line and band correspond to the SM prediction and associated theoretical uncertainty 


The H — rr mode provides the best sensitivity for the direct measurement for Higgs 
boson coupling to fermions. It benefits from a relatively large branching fraction, a 
moderate mass resolution (~ 10-20%) and provides good sensitivity to both the ggH 
and VBF production processes. 
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The H — tt mode is studied via tau decays to eu, Mu, eth, th, ThTh, 
where electrons and muons arise from leptonic t-decays and rj denotes a c 
lepton decaying hadronically. Each of these categories is further divided into three 
sub-categories labeled 0-jet, boosted and VBF. The 0-jet category helps constrain 
background normalisation, identification efficiencies, and energy scales, and sys- 
tematic uncertainties in the background model. The main irreducible background, 
Z — tt production, and the largest reducible backgrounds (W + jets, QCD multijet 
production, and top quark pair) are evaluated from control samples in data. 

CMS has observed the H — tt mode using a data sample corresponding to an 
integrated luminosity of 35.9fb~! at ./s = 13 TeV [45]. Figure 6.15a shows the 
distribution of the decimal logarithm of the ratio of the expected signal and the sum 
of expected signal and expected background in each bin of the mass distributions 
used to extract the results, in all signal regions. The background contributions 
are separated by decay channel. The inset shows the corresponding difference 
between the observed data and expected background distributions divided by the 
background expectation, as well as the signal expectation divided by the background 
expectation. The best fit of the product of the observed H — tt signal production 
cross section and branching fraction is 1.09*027 o 56 times the SM expectation. The 
combination with the corresponding measurement performed with data collected by 
the CMS experiment at center-of-mass energies of 7 and 8 TeV leads to an observed 
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Fig. 6.15 (a) Distribution of the decimal logarithm of the ratio between the expected signal and 
the sum of expected signal and expected. The inset shows the corresponding difference between 
the observed data and expected background distributions divided by the background expectation, 
as well as the signal expectation divided by the background expectation. (b) Best fit signal strength 
per category 
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significance of 5.9 standard deviations, equal to the expected significance. Figure 
6.15b right plots the signal strength per category for my = 125.09 GeV. 

ATLAS has observed the H — tt mode using 36.1 fb~! of data recorded at 
A/s = 13 TeV [46]. All combinations of leptonic and hadronic tau decays were 
considered. Combining all data taken at \/s = 7, 8 and 13 TeV, the observed 
(expected) significance is found to be 6.4 (5.4) standard deviations. Using the data 
taken at ,/s = 13 TeV, the total cross section, in the H — cr decay channel, is 
measured to be 3.71 + 0.59 (stat) +0.87 4 44 (syst) pb, for my = 125 GeV, assuming 
the relative contributions of its production modes predicted by the SM. Total cross 
sections are determined separately for vector boson fusion production and gluon— 
gluon fusion production to be o(VBF, H — rr) = 0.28 + 0:09 (stat) +0.11 pog 
(syst) pb and o(ggF, H — tt) = 3.0 + 1.0 (stat.) ^9 ,» (syst) pb, respectively. 
The measured values for o(H — tT), when only the data of individual channels 
are used, are shown in Fig. 6.16a, along with the result from the combined fit. 
The theory uncertainty in the predicted signal cross section is shown by the yellow 
band. Figure 6.16b shows the likelihood contours in the variables (ggE, H — tr) 
and (VBF, H — rr) for the combination of all channels. The 68% and 95% CL 
contours are shown as dashed and solid lines, respectively, for mg — 125 GeV. The 
SM expectation is indicated by a plus symbol and the best fit to the data is shown as 
a star. All measurements are in agreement with SM expectations. 


6.7.3.5 H— bb Decay Mode 


In the SM, fermions couple directly to the Higgs boson via the Yukawa interaction. 
A clear test of this hypothesis would be the measurement of the H— bb coupling. 
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Fig. 6.16 (a) Measured values for o(H — tT) when only the data of individual channels are used. 
Also shown is the result from the combined fit. The total + lo uncertainty in the measurement 
is indicated in black, with the individual contribution from the statistical uncertainty in blue. (b) 
Likelihood contours for the combination of all channels in the [(ggF, H > tT) v/s (VBF, H > tr)] 
plane. The SM expectation is indicated by a plus symbol and the best fit to the data is shown as a 
star 
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The H — bb decay mode has by far the largest branching ratio (~58%). However, 
this is the most difficult decay channel to observe, since bottom quark pairs are 
prolifically produced by QCD processes and give rise to a formidable background. 
The cross section for b-quark pair production, opp(QCD), is ~10’ x o(H — bb). 
Therefore the search concentrates on Higgs boson production in association with a 
W or Z boson using the following decay modes: W — ev/uv and Z — ee or uu or 
vv. The Z — vv decay is identified by the requirement of a large missing transverse 
energy. The Higgs boson candidate is reconstructed by requiring two b-tagged jets. 

Events are selected in 0-, 1- and 2-charged lepton (e or jt) channels, to explore the 
ZH — vvbb, WH — lvbb, ZH — libb signatures, respectively. Both experiments 
introduced several improvements since the initial searches including more efficient 
identification of b-jets, better dijet mass resolution and use of multivariate dis- 
criminants that better separate signal from background. Multivariate discriminants, 
built from variables that describe the kinematics of the selected events, are used to 
maximise the sensitivity to the Higgs boson signal. The signal extraction method 
is validated with, for example, the diboson analysis where the nominal multivariate 
analysis is modified to extract the VZ, Z — bb diboson process. 

ATLAS has observed the mode H — bb by analyzing the combined data from 
Run 1 and Run 2 [47], corresponding to an integrated luminosity of 80fb~! yielding 
an observed (expected) significance of 5.4 (5.5) standard deviations, thus providing 
direct observation of the Higgs boson decay into b-quarks. The signal strength is 
measured to be 1.01 4 0.12(stat) +0.16 9 is(syst). Figure 6.17a shows the distribution 
of mpp in data after subtraction of all backgrounds except for the WZ and ZZ? 
diboson processes using data taken at ./s = 13 TeV. The contributions from 
all lepton channels, p T regions, and number-of-jets categories are summed and 
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Fig. 6.17 (a) Distribution of mpp in data after subtraction of all backgrounds except for the WZ 
and ZZ diboson processes, as obtained with the dijet-mass analysis. (b) Fitted values of the Higgs 
boson signal strength, u, for my = 125 GeV for the WH and ZH processes and their combination, 
using the 7 TeV, 8 TeV and 13 TeV data 
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Fig. 6.18 (a) Dijet invariant mass distribution for events weighted by S/(S + B) in all channels 
combined in the 2016 and 2017 data sets. The error bar for each bin represents the presubtraction 10 
statistical uncertainty on the data, while the gray hatching indicates the 1o total uncertainty on the 
signal and all background components. (b) Best-fit value of the H — bb signal strength with its 1o 
systematic (red) and total (blue) uncertainties for the five individual production modes considered, 
as well as the overall combined result. The vertical dashed line indicates the SM expectation 


weighted by their respective S/B, with S being the total fitted signal and B the total 
fitted background in each region. The expected contribution of the associated WH 
and ZH production of a SM Higgs boson with my = 125 GeV is shown, scaled 
by the measured signal strength (u —1.06). The size of the combined statistical 
and systematic uncertainty for the fitted background is indicated by the hatched 
band. Figure 6.17b shows the fitted values of the Higgs boson signal strength, where 
M (VHbb) = 0.98 *02? 55 for my = 125 GeV for the WH and ZH processes and 
their combination, using the 7 TeV, 8 TeV and 13 TeV data. 

CMS has observed the mode H — bb. Figure 6.18a shows the weighted dijet 
invariant mass distribution for events weighted by S/(S + B) in all channels 
combined in the 2016 and 2017 data sets [48]. The data (points), the fitted VH 
signal (red) and VZ background (grey) distributions, with all other fitted background 
processes subtracted, except that from dibosons are shown in Fig. 6.18a. Figure 
6.18b shows the best-fit value of the H — bb signal strength for the five individual 
production modes considered, as well as the overall combined result. The vertical 
dashed line indicates the SM expectation. All results are extracted from a single 
fit with my = 125.09 GeV. CMS has made measurements, using data collected at 
A/s = 7, 8, and 13 TeV, and observes an excess of events at my = 125 GeV with 
a significance of 5.6 standard deviations, where the expectation for the SM Higgs 
boson is 5.5, and a signal strength of 1.04 + 0.14 (stat.) +0.14 (syst.). 
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6.7.3.6 H — wp Decay Mode 


The H — +u- decay mode extends the test of the Higgs boson’s coupling to the 
second generation of fermions. Several scenarios beyond the SM predict a higher 
branching fraction than the one predicted in the SM (2.2 x 107^ at my = 125 GeV). 

The dominant and irreducible background arises from the Z/y, — pp process 
that has a rate several orders of magnitude larger than that from the SM Higgs boson 
signal. However, due to the precise muon momentum measurement achieved by 
ATLAS and CMS, the dimuon mass resolution is excellent (~ 2—3%). A search 
is performed for a narrow peak over a large but smoothly falling background. 
For optimal search sensitivity, events are divided into several categories. Taking 
advantage of the superior muon momentum measurement in the central region 
events can be subdivided by the pseudorapidity of the muons, or by selections 
aiming at specific production processes. A category selecting the vector boson 
fusion process with its distinctive signature and relatively large cross section is 
particularly useful. 

ATLAS has performed this search using data corresponding to an integrated 
luminosity of 36.1 fb^! collected at /s = 13 TeV [49]. No significant excess is 
observed above the expected background. When combined with the data taken at 
a/s = 7 and 8 TeV, the observed (expected) cross-section upper limit is 2.8 (2.9) 
times the SM prediction. 

The search in CMS, using an integrated luminosity corresponding to 35.9 fb! 
recorded at ./s = 13 TeV [50], and combining with data taken at J/s = 7 and 8 
TeV, yielded an observed (expected) cross-section upper limit is 2.92 (2.16) times 
the Standard Model prediction. 


6.7.3.7 ttbar H Production Mode 


As m; » mg the Yukawa coupling of the Higgs boson to top quarks cannot be tested 
directly. However, it can be measured through the measurement in the pp. — ttH 
production process. The coupling of the Higgs boson to the top quark, the heaviest 
particle in the SM, could be very sensitive to the effects of physics beyond the SM. 

Although the pp. — ttH production process only contributes around 1% of the 
total Higgs-boson production cross section, the top quarks in the final state offer 
a distinctive signature and allow many Higgs-boson decay modes to be accessed. 
Of these, the decay to two b-quarks, the Higgs boson decay mode with the largest 
branching fraction, also is sensitive to the b-quark's Yukawa coupling, the second 
largest in the SM. 

A top quark decays almost exclusively to a bottom quark and a W boson, with 
the W boson subsequently decaying either to a quark and an antiquark or to a 
charged lepton and its associated neutrino. The Higgs boson has a rich spectrum 
of decay modes, and ttH production is studied using a wide variety of final state 
event topologies, with the Higgs boson decaying into bb, WW), tt, y y, and ZZ? 
pairs. 
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Fig. 6.19 (a) Observed event yields in all analysis categories in up to 79.8fb^! of 13 TeV data. 
The lower panel shows the ratio of the data to the background estimated from the fit with freely 
floating signal, compared to the expected distribution including the signal assuming u = 1.32 
(full red) and u = 1 (dashed orange). The error bars on the data are statistical. (b) Combined ttH 
production cross section, as well as cross sections measured in the individual analyses, divided by 
the SM prediction 


ATLAS has observed this production mode using data taken at ./s = 7 TeV, 
8 TeV and 13 TeV corresponding to integrated luminosities up to 79.8 fb^!. The 
Higgs boson decays included comprise bb, WW*, t + t—, yy, and ZZ*. The 
observed significance is 6.30, compared to an expectation of 5.1o [51]. Assuming 
SM branching fractions, the total (tH production cross section at ,/s = 13 TeV 
is measured to be 670 + 90(stat.) +110 oo(syst.) fb, in agreement with the SM 
prediction. Figure 6.19a shows the observed event yields in all analysis categories. 
The background yields correspond to the observed fit results, and the signal yields 
are shown for both the observed results (u = 1.32) and the SM prediction (u = 1). 
The ranking of the discriminant bins is carried out by logio(S/B), where S is the 
extracted signal yield and B the extracted background yield. Figure 6.19b shows 
the combined ttH production cross section, as well as cross sections measured in 
the individual analyses, divided by the SM prediction. The black lines show the 
total uncertainties, and the bands indicate the statistical and systematic uncertainties. 
The red vertical line indicates the SM cross-section prediction, and the grey band 
represents the PDF and os uncertainties and the uncertainties due to missing higher- 
order corrections. 

CMS has observed ttH production in a combined analysis of data at J/s = 7, 
8, and 13 TeV, corresponding to integrated luminosities of up to 5.1, 19.7, and 
35.9 fb—!, respectively [52]. An excess of events is observed with an observed 
(expected) significance of 5.2 (4.2) standard deviations, over the expectation from 
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Fig. 6.20 (a) Distribution of events as a function of the decimal logarithm of S/B. The shaded 
histogram shows the expected background distribution. The two hatched histograms, each stacked 
on top of the background histogram, show the signal expectation for the SM (pun = 1) and the 
observed (mag = 1.26) signal strengths. The lower panel shows the ratios of the expected signal 
and observed results relative to the expected background. (b) Best fit value of the ttH signal strength 
modifier utn, with its lo and 2o confidence intervals for upper section) the five individual decay 
channels considered, middle section) the combined result for 7 + 8 TeV alone and for 13 TeV 
alone, and lower section) the overall combined result. The SM expectation is shown as a dashed 
vertical line 


the background-only hypothesis for my = 125.09. The combined best-fit signal 
strength normalized to the standard model prediction is 1.26*031 4 56. Figure 6.20a 
shows the distribution of events as a function of the decimal logarithm of S/B, where 
S and B are the expected postfit signal (with pg = 1) and background yields, 
respectively, in each bin of the distributions considered in this combination. The 
shaded histogram shows the expected background distribution. The two hatched 
histograms, each stacked on top of the background histogram, show the signal 
expectation for the SM (pg = 1) and the observed (ag = 1.26) signal strengths. 
The lower panel shows the ratios of the expected signal and observed results relative 
to the expected background. Figure 6.20b plots the ttH signal strength modifiers, 
Ic, for the various selections and the overall combined result. The SM expectation 
is shown as a dashed vertical line. 
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6.7.4 Combining the Results 
6.7.4.1 Mass of the Observed State 


The mass of the Higgs boson is measured using the two decay channels that give 
the best mass resolutions namely H — yy and H — ZZ) — 4 I. ATLAS and 
CMS have combined their results from Run 1 [53]. The results were obtained from 
a simultaneous fit to the reconstructed invariant mass peaks in the two channels 
and for the two experiments. The measured masses from the individual channels 
and the two experiments were found to be consistent amongst themselves. The 
combined measured mass of the Higgs boson was found to be my = 125.09 + 0.21 
(stat.) + 0.11 (syst. GeV, a value subsequently used in many of the analyses 
discussed above. The results of these measurements and more recent ones are shown 
in Fig. 6.21 [54]. 

The mass of the Higgs boson, combined with the measured top quark mass, has 
cosmological implications. The current measurement of mg, along with that of the 
top quark mass [m; = 173.21 + 0.51 (stat) +0.71 (syst)] indicate that our universe 
is in a metastable state, which eventually will tunnel through the potential barrier to 
the true vacuum in which space collapses, albeit over a period of time that is many 
orders of magnitude larger than the lifetime of the universe so far. 


ATLAS and CMS + Total Stat. Syst. 


7 TeV, 8 TeV and 13 TeV Tot Stat Syst. 


ATLAS H —yy Run 1 SE 126.02 + 0.51 (+ 0.43 + 0.27) GeV 


CMS H —yyRun1 124.70 + 0.34 (+ 0.91 + 0.15) GeV 


ATLAS H — 4I Run 1 124.51 + 0.52 (+ 0.52 + 0.04) GeV 


CMS H — 41 Run 1 125.59 + 0.45 (+ 0.42 + 0.17) GeV 


ATLAS-CMS yy Run 1 125.07 + 0.29 (+ 0.25 + 0.14) GeV 


ATLAS-CMS 4I Run 1 125.15 + 0.40 (+ 0.37 + 0.15) GeV 


ATLAS H —yy Run 2 125.11 + 0.42 (+ 0.21 + 0.36) GeV 
ATLAS H > 4l Run 2 124.88 + 0.37 (+ 0.37 + 0.05) GeV 


CMS H —4lRun2 125.26 + 0.21 (+ 0.20 + 0.08) GeV 


AY Wl EY oaa da aea as acea] 


118 120 122 124 126 128 130 132 


Fig. 6.21 Summary of the CMS and ATLAS mass measurements in the yy and ZZ) channels in 
Run 1 and Run 2. Particle Data Group [54] 
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6.7.4.2 Compatibility of the Observed State with the SM Higgs Boson 
Hypothesis: Signal Strength 


In the SM, the Higgs boson is a fundamental scalar particle with spin-parity 
P= 0*, and couples to fundamental fermions as m/v and to fundamental 
bosons as my*/v* where v = 246 GeV. Several individual tests of compatibility 
with expectations from the SM have been discussed above. 

Here we discuss the signal strength, u, as determined by the combination of 
results from all channels. ATLAS and CMS combined their data from Run 1 [37] 
from the analysis of five production processes, namely gluon fusion, vector boson 
fusion, and associated production with a W or a Z boson or a pair of top quarks, and 
of the five decay modes H — yy, ZZ“, WW“, bb, and tt. All results are reported 
assuming a value of 125.09 GeV for the Higgs boson mass. The Higgs boson 
production and decay rates measured by the two experiments are combined within 
the context of three generic parameterisations: two based on cross sections and 
branching fractions, and one on ratios of coupling modifiers. Several interpretations 
of the measurements with more model-dependent parameterisations are also given. 
The combined signal yield relative to the SM prediction is measured to be 
1.09 + 0.11. The error is broken down as +0.07 (statistical), +0.04 (experimental 
systematic), +0.03 (theoretical on background) and + 0.07 (theoretical on signal). 

The most recent measured values of u, using the above-mentioned channels, 
are: 


* ATLAS: y = 1.13 + 9 og, using 79.8 fb! at ./s = 13 TeV [55], 
e CMS: u = 1.17 0.1, using 35.9 fb™! at A/s = 13 TeV [56]. 


The error in the measurement from ATLAS has the following breakdown: +0.05 
(statistical), +0.05 (experimental systematic), +0.03 (theoretical on background) 
and (+0.05, —0.04) (theoretical on signal). This should give a flavor of the 
possibilities for extrapolation into the future. 


6.7.4.3 Compatibility of the Observed State with the SM Higgs Boson 
Hypothesis: Couplings 


The 25 products, u; x jf, where i (f) is the production (decay) index can also 
be considered as free parameters. This can be viewed as the measurements of 
cross sections times branching fractions, sigmax BR, by production mechanism and 
decay mode. The results from the ATLAS and CMS have been combined and are 
illustrated in Fig. 6.22 from the Particle Data Group [54] showing compatibility with 
the SM. 

Figure 6.23a, from the ATLAS experiment [55], illustrates the dependence of 
the Higgs boson couplings on mass of the decay particles (p, t, b-quark, W, Z 
and t-quark). The plot is made for reduced coupling strength modifiers, x, using 
kr(mr/v) for fermions and (A/kvy)(mv/v) for vector bosons, where v = 246.22 GeV. 
The couplings modifiers kp and ky are measured assuming no BSM contributions 
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Fig. 6.22 Combined measurements of the products o.BR for the five main production and five 
main decay modes. The hatched combinations require more data for a meaningful confidence 
interval to be provided 
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Fig. 6.23 (a) Reduced coupling strength modifiers kp(mp/v) for fermions (t, b, t, p) and 
(./ky)(my/v) for vector bosons (W, Z) as a function of their masses mp and my, respectively. 
The SM prediction for both cases is also shown (dotted line). (b) Likelihood scan in the M-e plane. 
The best-fit point and, 1c, 20 CL regions are shown, along with the SM prediction 


to the Higgs boson decays, and the SM structure of loop processes. The line is the 
expectation from the SM. 

CMS has made a similar fit, shown in Fig. 6.23b [56], using a phenomenological 
parameterization relating the masses of the fermions and vector bosons to the 
corresponding k modifiers using two parameters, denoted M and e. In such a 
model one can relate the coupling modifiers to M and € as kp = v m$ / MI** for 
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fermions, and ky = v mit /M 1+2€ for vector bosons. The SM expectation, kj = 1, 
is recovered when (M, &) = (v, 0). 

Both Fig. 6.23a and b demonstrate good compatibility with the SM within the 
errors on the measurements. 


6.7.4.4 Compatibility of the Observed State with the SM Higgs Boson 
Hypothesis: Quantum Numbers 


Ascertaining the quantum numbers of the Higgs boson is essential to the understand- 
ing of its nature and its coupling properties. According to the Landau- Yang theorem 
the observations made in the diphoton channel excludes the spin-1 hypothesis and 
restricts possibilities for the boson to have spin-0 or -2. The diphoton decay mode 
also implies that the boson has charge conjugation has C-even. 

To identify the spin-parity of the Higgs boson the production and decay processes 
are examined in several analyses. The angular distributions of the decay particles 
can be used to test various spin hypotheses. 

Much can be gleaned from the decay mode H — ZZ“) — 4 l, where the full 
final state is reconstructed, including the angular variables sensitive to the spin- 
parity, along with a very favourable signal/background ratio. CMS has used the 
information from the five angles (see Fig. 6.24a) and the two dilepton pair masses 
combined to form a discriminant based on the 0* nature of the Higgs boson [57]. 

ATLAS has also tested various JP hypotheses, and in particular 0+ and 07. 

In all scenarios investigated by both CMS [57] and ATLAS [58] experiments, the 
data are compatible with the 0* hypothesis, excluding a pseudoscalar nature at CLS 
levels of 99.8% and 98.0%, respectively. The expected distribution in ATLAS of the 
test statistic for the SM hypothesis (in blue) and several alternative spin and parity 
hypotheses is compared in Fig. 6.24b. The combination of the three decay processes 
allows the exclusion of all considered non-SM hypotheses at amore than 99.9% CL 
in favour of the SM spin-0 hypothesis. 


6.8 Conclusions and Outlook 


In July 2012 the ATLAS and CMS experiments announced the discovery of a Higgs 
boson, confirming the conjecture put forward in the 1960s. Further results from the 
two experiments show that, within the current measurement precision, the Higgs 
boson has the properties predicted by the SM. However, several theories of physics 
beyond the SM (BSM) predict the existence of more than one Higgs boson, and 
one of these would only be subtly different from that predicted in the SM one, with 
signal strengths differing by between 0.5-5%, depending on the model in question, 
indicative of the required level of sensitivity to distinguish it from a SM Higgs 
boson. 
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H > ZZ* > 4I 
NS - 7 TeV,4.5fb 
Ns - 8 TeV, 20.3 fb 


ATLAS H+ ZZ* > 4! 
NS - 7 TeV,4.5 fb 
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0* SM H > WW* > evuv 


H > WW* > evuv 
=-= 2* (kq=2Kg p. «125 GeV) 4s = 8 TeV, 20.3 tb 


Ns = 8 TeV, 20.3 fb 
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Arbitrary normalisation 
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Fig. 6.24 (a) Illustration of the production and decay of a particle X — ZıZ2 — 4 I with the 
two production angles 0. and , shown in the X rest frame and three decay angles 01, 62, and ® 
shown in the Z; and X rest frames, respectively. (b) Examples of distributions of the test statistic 
Cq) defined in for the combination of decay channels left) 0* versus 07 right) 0* versus the spin-2 
model. The observed values are indicated by the vertical solid line and the expected medians by 
the dashed lines. The shaded areas correspond to the integrals of the expected distributions used 
for the rejection of each hypothesis 


In Run 2 (2015-2018) the LHC provided proton-proton collisions at J/s = 13 
TeV with a peak instantaneous luminosity of 2 x 10?* cm7? s-, a factor of two 
beyond the design value. It is intended to operate the accelerator at /s = 14 TeV 
after the next long shutdown (2019—2020) and to integrate a luminosity correspond- 
ing to some 300 fb™! by the end of Run 3 (2021—2024). More precise measurements 
of the properties of the new boson will be made, as well as a more extensive 
exploration of physics beyond the SM, for which many possibilities are conjectured 
including supersymmetry, extra dimensions, unified theories, superstrings etc. 

The results presented in Chap. 6 are still mostly dominated by statistical errors. 
The ATLAS and CMS experiments continually update their results that can be found 
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through their websites quoted in references [36, 37]. Much more data need to be 
collected to enable rigorous testing of the compatibility of the Higgs boson with 
the SM and to get clues to physics lying beyond the SM in case of a significant 
deviation. This is one of the main motivations for the high luminosity LHC project, 
labeled the HL-LHC. 

Europe's topmost priority in particle physics calls for the exploitation of the full 
potential of the LHC, including the high-luminosity upgrade of the accelerator and 
detectors with a view to collecting ten times more data than in the initial design. It is 
planned to increase the instantaneous luminosity of the LHC to 5 x 10?^ cm? s7!, 
and record, by around 2035, an integrated luminosity corresponding to ~3000 fb! 
(ten times larger than the original design value). Such an integrated luminosity also 
requires substantial upgrades of the ATLAS and CMS experiments, now underway, 
to allow a very precise measurement of the properties of the Higgs boson the study 
of its rare decay modes and self-coupling, in addition to the search for physics 
beyond the SM. Many theories beyond the SM make different predictions for the 
properties of one or more Higgs bosons. 

Based on the currently analysed data ATLAS and CMS experiments have 
recently made projections of the attainable sensitivity for such measurements by 
the end of the HL-LHC phase [59, 60]. As around 150 million Higgs bosons will 
be produced a search can also be made for exotic and rare decays of the boson. 
Figures 6.25 and 6.26 show two sets of projections; Scenario 1 (S1) using the 
current theoretical errors or Scenario 2 (S2) where the theoretical errors are halved. 
The extrapolations show the possibility of measuring the individual signal strengths 
with a precision of between 5-10% for an integrated luminosity of 300 fb~!, and a 
few percent for a dataset corresponding to 3000 fb~! per experiment, dominated by 
theoretical erros. The per-production mode signal strength parameters are projected 
to be measurable with uncertainties of between 3—6% for a dataset corresponding to 


300 fb” (13 TeV) 3000 fb (13 TeV) 


CMS wi Run 2 syst uncert (S1) CMS w Run 2 syst uncer. (S1) ATLAS Preliminary Total(S1) 
iir w/YRI8 syst. uncer. (52) hosti —À wi YRIB syst uncer (82) Pr E: 
'ojection from Run 2 data 
Projection — s uncer ony Projection — s ucen oniy rz 1 Total(S2 
Basu = 0 Baa = 0 fs = 14 TeV, 3000 tb (S2) ——4 
Ky Hs Ky HH 
Y 0.03 (Stat); 0.04 (52); 0.06 (51) Y 001 (Stat): 0.02 (52); 0.03 (51) BR. LM —À 
Ky RH Ky H= 
0,03 (Stat); 0.04 (82); 0.05 (81) 0.01 (Stat); 0.02 ($2); 0.03 (81) BR, — 
Kz m Kz H= 
0.03 (Stat); 0.04 (52); 0.05 (81) 0.01 (Stat); 0.02 (S2); 0.02 (81) Bey p 
Kg m Kg i —— 
9 003 (Stat); 0.05 (52): 006 (51) 9 001 (Sta 0.02 (52) 004 (51) [ a 
x —————À Ky be 
t 0.03 (Stat 0.06 (52; 0.08 (51) t 0.01 (Stat 0.03 (52); 006 (51) BR pem 
X, Ky H+ 
506 (Sat; 029 (821; 0/1 (81) 0.02 Sat 094 (S2; 0.08 (51) BR, LLIL—3Àà44 | 
K: I ———a4 Kı þa 
z m— t cu (pini 08 ee db o) BR. (emm 
k È eee TETEE TE TE were) 
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Fig. 6.25 Summary plot from CMS showing the total expected +1o uncertainties in S1 and S2 
on the per-decay mode signal strength parameters for 300 fb-! (left) and 3000 fb^! (centre). 
The statistical-only component of the uncertainty is also shown. (right) Expected uncertainty on 
the branching ratio measurements in ATLAS for the gg, ZZ“), WW, tt, bb, pp and Zy decay 
channels normalized to their SM predictions assuming SM production cross section for scenarios 
S1 (red) and S2 (black) 
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Fig. 6.26 (a) Expected uncertainty on the measurement of cross sections in ATLAS for the ggF, 
VBF, WH, ZH and ttH production modes normalized to their SM predictions assuming SM 
branching fractions for Scenarios S1 (red) and S2 (black). (b) Summary plot from CMS showing 
the total expected +10 uncertainties in S1 and S2 on the cross section measurements for 300 fb! 
(left) and 3000 fb-! (right). The statistical-only component of the uncertainty is also shown 


3000 fb~! per experiment. Of particular note, in view of a future electron-positron 
collider, is the projection for the measurement of the ttH coupling with a precision 
of ~5% per experiment. 

The discovery of a Higgs boson implies the discovery of a fundamental scalar 
field that pervades the universe. Astronomical and astrophysical measurements point 
to the following composition of energy-matter in the universe: ~4% normal matter 
that "shines", ~23% dark matter, and the rest in the form of “dark energy." Dark 
matter is weakly and gravitationally interacting matter with no electromagnetic or 
strong interactions. These are the properties carried by the lightest supersymmetic 
particle. Hence the question: Is dark matter supersymmetric in nature? Fundamental 
scalar fields could well have played a critical role in the conjectured inflation of our 
universe immediately after the Big Bang, and in the recently observed accelerating 
expansion of the universe that, among other measurements, signals the presence of 
dark energy in our universe. 

The discovery of the Higgs boson could turn out to be a portal to physics beyond 
the SM. Physicists at the LHC are eagerly looking forward to further running 
of the LHC, and the HL-LHC, and to establishing the true nature of the Higgs 
boson, to find clues or answers to some of the other fundamental open questions 
in particle physics and cosmology. The exploitation of the LHC is in its infancy, 
having recorded a small fraction of the finally anticipated integrated luminosity, and 
the expectations for other discoveries in the coming decades are high. 
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Chapter 7 ff) 
Relativistic Nucleus-Nucleus Collisions Geek for 
and the QCD Matter Phase Diagram 


Reinhard Stock 


7.1 Introduction 


7.1.1 Overview 


This review will be concerned with our knowledge of extended matter under the 
governance of strong interaction, in short: QCD matter. Strictly speaking, the 
hadrons are representing the first layer of extended QCD architecture. In fact 
we encounter the characteristic phenomena of confinement as distances grow to 
the scale of 1 fm (i.e. hadron size): loss of the chiral symmetry property of the 
elementary QCD Lagrangian via non-perturbative generation of “massive” quark 
and gluon condensates, that replace the bare QCD vacuum [1]. However, given such 
first experiences of transition from short range perturbative QCD phenomena (jet 
physics etc.), toward extended, non perturbative QCD hadron structure, we shall 
proceed here to systems with dimensions far exceeding the force range: matter 
in the interior of heavy nuclei, or in neutron stars, and primordial matter in the 
cosmological era from electro-weak decoupling (107!? s) to hadron formation 
(0.5- 1075 s). This primordial matter, prior to hadronization, should be deconfined in 
its QCD sector, forming a plasma (i.e. color conducting) state of quarks and gluons 
[2]: the Quark Gluon Plasma (QGP). 

In order to recreate matter at the corresponding high energy density in the 
terrestrial laboratory one collides heavy nuclei (also called “heavy ions") at ultrarel- 
ativistic energies. Quantum Chromodynamics predicts [2—4] a phase transformation 
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to occur between deconfined quarks and confined hadrons. At near-zero net 
baryon density (corresponding to big bang conditions) non-perturbative Lattice- 
QCD places this transition at an energy density of about 1 GeV/fm?, and at a 
critical temperature, Tert ^ 170 MeV [4-8] (see the article on Lattice QCD in 
this Volume). The ultimate goal of the physics with ultrarelativistic heavy ions is 
to locate this transition, elaborate its properties, and gain insight into the detailed 
nature of the deconfined QGP phase that should exist above. What is meant by 
the term "ultrarelativistic" is defined by the requirement that the reaction dynamics 
reaches or exceeds the critical density e ~ 1 GeV/fm?. Required beam energies turn 
out [8] to be ./s > 10 GeV, and various experimental programs have been carried 
out or are being prepared at the CERN SPS (up to about 20 GeV), at the BNL RHIC 
collider (up to 200 GeV) and finally reaching up to 5.5 TeV at the LHC of CERN. 

QCD confinement-deconfinement is of course not limited to the domain that 
is relevant to cosmological expansion dynamics, at very small excess of baryon 
over anti-baryon number density and, thus, near zero baryo-chemical potential 
uB. In fact, modern QCD suggests [9-11] a detailed phase diagram of QCD 
matter and its states, in the plane of T and baryo-chemical potential ug. For a 
map of the QCD matter phase diagram we are thus employing the terminology 
of the grand canonical Gibbs ensemble that describes an extended volume V of 
partonic or hadronic matter at temperature T'. In it, total particle number is not 
conserved at relativistic energy, due to particle production-annihilation processes 
occurring at the microscopic level. However, the probability distributions (partition 
functions) describing the relative particle species abundances have to respect the 
presence of certain, to be conserved net quantum numbers (7), notably non-zero 
net baryon number and zero net strangeness and charm. Their global conservation 
is achieved by a thermodynamic trick, adding to the system Lagrangian a so-called 
Lagrange multiplier term, for each of such quantum number conservation tasks. This 
procedure enters a “chemical potential" u; that modifies the partition function via 
an extra term exp (—j;/T) occurring in the phase space integral (see Sect. 7.3 for 
detail). It modifies the canonical “punishment factor" (exp (— E/ T)), where E is the 
total particle energy in vacuum, to arrive at an analogous grand canonical factor for 
the extended medium, of exp (— E/T — uj / T). This concept is of prime importance 
for a description of the state of matter created in heavy ion collisions, where 
net-baryon number (valence quarks) carrying objects are considered—extended 
"fireballs" of QCD matter. The same applies to the matter in the interior of neutron 
stars. The corresponding conservation of net baryon number is introduced into the 
grand canonical statistical model of QCD matter via the “baryo-chemical potential" 
Lp. 

We employ this terminology to draw a phase diagram of QCD matter in Fig. 7.1, 
in the variables T and ug. Note that upg is high at low energies of collisions creating 
a matter fireball. In a head-on collision of two mass 200 nuclei at ys = 15 GeV 
the fireball contains about equal numbers of newly created quark-antiquark pairs 
(of zero net baryon number), and of initial valence quarks. The accommodation of 
the latter, into created hadronic species, thus requires a formidable redistribution 
task of net baryon number, reflecting in a high value of ug. Conversely, at LHC 
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Fig. 7.1 Sketch of the QCD 
matter phase diagram in the 
plane of temperature T and 
baryo-chemical potential ug. 
The parton-hadron phase 
transition line from lattice 
QCD [8-11] ends in a critical 
point E. A cross-over 
transition occurs at smaller 
up. Also shown are the 
points of hadro-chemical 
freeze-out from the grand 
canonical statistical model 


quark gluon plasma 


chemical freeze-out 
A. SIS, AGS 


E W 5P5 (NA49) 
"9 * RHIC 


color super- 
hadrons conductor 


ug [MeV] 


energy (5.5 TeV for Pb+Pb collisions), the initial valence quarks constitute a mere 
596 fraction of the total quark density, correspondingly requiring a small value of 
uB. In the extreme, big bang matter evolves toward hadronization (at T2170 MeV) 
featuring a quark over antiquark density excess of 107° only, resulting in ug ^ 0. 

Note that the limits of existence of the hadronic phase are not only reached by 
temperature increase, to the so-called Hagedorn value Ty (which coincides with 
Terit at ug — 0), but also by density increase to o > (5—10) oo: “cold compression" 
beyond the nuclear matter ground state baryon density oo of about 0.16 B/fm?. 
We are talking about the deep interior sections of neutron stars or about neutron 
star mergers [12-14]. A sketch of the present view of the QCD phase diagram 
[9-11] is given in Fig. 7.1. It is dominated by the parton-hadron phase transition 
line that interpolates smoothly between the extremes of predominant matter heating 
(high T, low ug) and predominant matter compression (T — 0, ug > 1 GeV). 
Onward from the latter conditions, the transition is expected to be of first order [15] 
until the critical point of QCD matter is reached at 200 < ug (E) < 500 MeV. 
The relatively large position uncertainty reflects the preliminary character of Lattice 
QCD calculations at finite ug [9-11]. Onward from the critical point, E, the phase 
transformation at lower upg is a cross-over[1 1]. 

We note, however, that these estimates represent a major recent advance of lattice 
theory which was, for two decades, believed to be restricted to the ug = O situation. 
Onward from the critical point, toward lower upg, the phase transformation should 
acquire the properties of a rapid cross-over [16], thus also including the case of 
primordial cosmological expansion. This would finally rule out former ideas, based 
on the picture of a violent first order "explosive" cosmological hadronization phase 
transition, that might have caused non-homogeneous conditions, prevailing during 
early nucleo-synthesis [17], and fluctuations of global matter distribution density 
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that could have served as seedlings of galactic cluster formation [18]. However, it 
needs to be stressed that the conjectured order of phase transformation, occurring 
along the parton-hadron phase boundary line, has not been unambiguously con- 
firmed by experiment, as of now. 

On the other hand, the position of the QCD phase boundary at low upg has, 
in fact, been located by the hadronization points in the T, jp plane that are also 
illustrated in Fig. 7.1. They are obtained from statistical model analysis [19] of the 
various hadron multiplicities created in nucleus-nucleus collisions, which results 
in a [T, jp] determination at each incident energy, which ranges from SIS via 
AGS and SPS to RHIC energies, i.e. 3 < ./s < 200 GeV. Toward low ug these 
hadronic freeze-out points merge with the lattice QCD parton-hadron coexistence 
line: hadron formation coincides with hadronic species freeze-out. These points 
also indicate the ug domain of the phase diagram which is accessible to relativistic 
nuclear collisions. The domain at ug > 1.5 GeV which is predicted to be in a further 
new phase of QCD featuring color-flavor locking and color superconductivity [20] 
will probably be accessible only to astrophysical observation. 

One may wonder how states and phases of matter in thermodynamical 
equilibrium—as implied by a description in grand canonical variables—can be 
sampled via the dynamical evolution of relativistic nuclear collisions. Employing 
heavy nuclei, A ~% 200, as projectiles/targets or in colliding beams (RHIC, LHC), 
transverse dimensions of the primordial interaction volume do not exceed about 
8 fm, and strong interaction ceases after about 20 fm/c. We shall devote an entire 
later section to the aspects of equilibrium (Sect. 7.2.5) but note, for now, that 
the time and dimension scale of primordial perturbative QCD interaction at the 
microscopic partonic level amounts to subfractions of 1 fm/c, the latter scale, 
however, being representative of non perturbative processes (confinement, "string" 
formation etc.). The A+A fireball size thus exceeds, by far, the elementary non 
perturbative scale. An equilibrium quark gluon plasma represents an extended 
non-perturbative QCD object, and the question whether its relaxation time scale 
can be provided by the expansion time scale of an A+A collision, needs careful 
examination. Reassuringly, however, the hadrons that are supposedly created from 
such a preceding non-perturbative QGP phase at top SPS and RHIC energy, do in 
fact exhibit perfect hadrochemical equilibrium, the derived [T, ug] values [19] thus 
legitimately appearing in the phase diagram, Fig. 7.1. 

In the present review we will order the physics observables to be treated, 
in sequence of their origin from successive stages that characterize the overall 
dynamical evolution of a relativistic nucleus-nucleus collision. In rough outline 
this evolution can be seen to proceed in three major steps. An initial period of 
matter compression and heating occurs in the course of interpenetration of the 
projectile and target baryon density distributions. Inelastic processes occurring at 
the microscopic level convert initial beam longitudinal energy to new internal and 
transverse degrees of freedom, by breaking up the initial baryon structure functions. 
Their partons thus acquire virtual mass, populating transverse phase space in the 
course of inelastic perturbative QCD shower multiplication. This stage should be 
far from thermal equilibrium, initially. However, in step two, inelastic interaction 
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between the two arising parton fields (opposing each other in longitudinal phase 
space) should lead to a pile-up of partonic energy density centered at mid-rapidity 
(the longitudinal coordinate of the overall center of mass). Due to this mutual 
stopping down of the initial target and projectile parton fragmentation showers, and 
from the concurrent decrease of parton virtuality (with decreasing average square 
momentum transfer Q?) there results a slowdown of the time scales governing the 
dynamical evolution. Equilibrium could be approached here, the system “lands” 
on the T, u plane of Fig. 7.1, at temperatures of about 300 and 200 MeV at 
top RHIC and top SPS energy, respectively. The third step, system expansion and 
decay, thus occurs from well above the QCD parton-hadron boundary line. Hadrons 
and hadronic resonances then form, which decouple swiftly from further inelastic 
transmutation so that their yield ratios become stationary ("frozen-out"). A final 
expansion period dilutes the system to a degree such that strong interaction ceases 
all together. 

In order to verify in detail this qualitative overall model, and to ascertain the 
existence (and to study the properties) of the different states of QCD that are popu- 
lated in sequence, one seeks observable physics quantities that convey information 
imprinted during distinct stages of the dynamical evolution, and “freezing-out” 
without significant obliteration by subsequent stages. Ordered in sequence of their 
formation in the course of the dynamics, the most relevant such observables are 
briefly characterized below: 


1. Suppression of J/ V and Y production by Debye-screening in the QGP. These 
vector mesons result from primordial, pQCD production of cc and bb pairs 
that would hadronize unimpeded in elementary collisions but are broken up if 
immersed into a npQCD deconfined QGP, at certain characteristic temperature 
thresholds. 

2. Suppression of dijets which arise from primordial qq pair production fragment- 
ing into partonic showers (jets) in vacuum but being attenuated by QGP-medium 
induced gluonic bremsstrahlung: Jet quenching in A+A collisions. 


a. A variant of this: any primordial hard parton suffers a high, specific loss of 
energy when traversing a deconfined medium: High pr suppression in A-A 
collisions. 


3. Hydrodynamic collective motion develops with the onset of (local) thermal 
equilibrium. It is created by partonic pressure gradients that reflect the initial col- 
lisional impact geometry via non-isotropies in particle emission called “directed” 
and "elliptic" flow. The latter reveals properties of the QGP, seen here as an ideal 
partonic fluid. 


a. Radial hydrodynamical expansion flow (“Hubble expansion") is a variant of 
the above that occurs in central, head on collisions with cylinder symmetry, as 
a consequence of an isentropic expansion. It should be sensitive to the mixed 
phase conditions characteristic of a first order parton-hadron phase transition. 
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4. Hadronic "chemical" freeze-out fixes the abundance ratios of the hadronic 
species into an equilibrium distribution. Occurring very close to, or at hadroniza- 
tion, it reveals the dynamical evolution path in the [T, ug] plane and determines 
the critical temperature and density of QCD. The yield distributions in A-A 
collisions show a dramatic strangeness enhancement effect, characteristic of an 
extended QCD medium. 

5. Fluctuations, from one collision event to another (and even within a single given 
event) can be quantified in A+A collisions due to the high charged hadron 
multiplicity density (of up to 600 per rapidity unit at top RHIC energy). Such 
event-by-event (Debye) fluctuations of pion rapidity density and mean transverse 
momentum (event "temperature"), as well as event-wise fluctuations of the 
strange to non-strange hadron abundance ratio (may) reflect the existence and 
position of the conjectured critical point of QCD (Fig. 7.1). 

6. Two particle Bose-Einstein-Correlations are the analog of the Hanbury-Brown, 
Twiss (HBT) effect of quantum optics. They result from the last interaction 
experienced by hadrons, i.e. from the global decoupling stage. Owing to a near 
isentropic hadronic expansion they reveal information on the overall space-time- 
development of the “fireball” evolution. 


In an overall view the first group of observables (1 to 2a) is anchored in 
established pQCD physics that is well known from theoretical and experimental 
analysis of elementary collisions (e*e^ annihilation, pp and pp data). In fact, the 
first generation of high Q? baryon collisions, occurring at the microscopic level 
in A+A collisions, should closely resemble such processes. However, their primary 
partonic products do not escape into pQCD vacuum but get attenuated by interaction 
with the concurrently developing extended high density medium, thus serving as 
diagnostic tracer probes of that state. The remaining observables capture snapshots 
of the bulk matter medium itself. After initial equilibration we may confront elliptic 
flow data with QCD during the corresponding partonic phase of the dynamical 
evolution employing thermodynamic [21] and hydrodynamic [22] models of a high 
temperature parton plasma. The hydro-model stays applicable well into the hadronic 
phase. Hadron formation (confinement) occurs in between these phases (at about 
5 us time in the cosmological evolution). In fact relativistic nuclear collision data 
may help to finally pin down the mechanism(s) of this fascinating QCD process 
[23-25] as we can vary the conditions of its occurrence, along the parton-hadron 
phase separation line of Fig. 7.1, by proper choice of collisional energy ./s, and 
system size A, while maintaining the overall conditions of an extended imbedding 
medium of high energy density within which various patterns [9—11, 15, 16] of the 
hadronization phase transition may establish. The remaining physics observables 
(3a, 5 and 6 above) essentially provide for auxiliary information about the bulk 
matter system as it traverses (and emerges from) the hadronization stage, with 
special emphasis placed on manifestations of the conjectured critical point. 

The present review will briefly cover each of the above physics observables in a 
separate chapter, beginning with the phenomena of confinement and hadronization 
(Sect. 7.3), then to turn to the preceding primordial dynamics, e.g. to elliptical 
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flow (Sect. 7.4), high pr and jet quenching (Sect. 7.5) and quarkonium suppression 
(Sect. 7.6) as well as in-medium o-meson “melting”. We then turn to the late period, 
with correlation and fluctuation studies (Sect. 7.7). We conclude (Sect. 7.8) with a 
summary, including an outlook to the future of the research field. 

However, before turning to such specific observables we shall continue this 
introductory chapter, with a look at the origin, and earlier development of the ideas 
that have shaped this field of research (Sect. 7.1.2). Then we turn to a detailed 
description of the overall dynamical evolution of relativistic nucleus-nucleus col- 
lisions, and to the typical overall patterns governing the final distributions in 
transverse and longitudinal (rapidity) phase space (Sect. 7.2). The aspects of an 
approach toward equilibrium, at various stages of the dynamical evolution (which 
are of key importance toward the intended elucidation of the QCD matter phase 
diagram), will be considered, in particular. 


7.1.2 History 


The search for the phase diagram of strongly interacting matter arose in the 1960s, 
from a coincidence of ideas developing—at first fairly independently—in nuclear 
and astrophysics. In fact, the nuclear proton-neutron matter, a quantum liquid at 
T = 0 and energy density € = 0.15 GeV/fm?, represents the ground state of 
extended QCD matter. Of course, QCD was unknown during the development 
of traditional nuclear physics, and the extended matter aspects of nuclei—such 
as compressibility or the equation of state, in general—did not receive much 
attention until the advent, in the 1960s, of relativistic nuclear mean field theory, 
notably s-matrix theory by Brueckner [26] and the o-model of Walecka [27]. These 
theories developed the novel view of "infinite nuclear matter" structure, based on 
in-medium properties of the constituent baryons that share parts of their vacuum 
mass and surface structure with the surrounding, continuous field of relativistic 
scalar and vector mesons. Most importantly, in the light of subsequent development, 
these theories allowed for a generalization away from ground state density and 
zero temperature. Such developments turned out to be of key relevance for acute 
nuclear astrophysics problems: the dynamics of type II super-novae and the stability 
of neutron stars, which both required the relation of pressure to density and 
temperature of hadronic matter, i.e. the hadronic matter equation of state (EOS). 
H.A. Bethe et al. [28] postulated that the final stages of supernova collapse should 
evolve through the density interval 0.1 < @/09 < 5 where ọọ = 0.16 (baryons 
per fm?) is the nuclear matter ground state density, and a similar domain was 
expected for the neutron star density variation from surface to interior [29]. It 
was clear that, at the highest thus considered densities the EOS might soften due 
to strange hadron production caused by increasing Fermi energy. However the 
field theoretical models permitted no reliable extrapolation to such high densities 
(which, in retrospect, are perhaps not reached in supernova dynamics [30]), and the 
experimental information concerning the EOS from study of the giant monopole 
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resonance—a collective density oscillation also called “breathing mode"— covered 
only the parabolic minimum occurring in the related function of energy vs. density 
at T = 0, o = oo. 

The situation changed around 1970 due to the prediction made by W. Greiner et 
al. [31] that nucleus-nucleus collisions, at relatively modest relativistic energies, 
would result in shock compression. This mechanism promised to reach matter 
densities far beyond those of a mere superposition (i.e. 9/090 < 2y) of initial 
target and projectile densities. Coinciding in time, the newly developed Bevalac 
accelerator at LBL Berkeley offered projectiles up to *8Ar, at just the required 
energies, 100 MeV < Ejap/nucleon < 2 GeV. The field of “relativistic heavy 
ion physics” was born. The topic was confronted, at first, with experimental 
methods available both from nuclear and particle physics. It was shown that particle 
production (here still restricted to pions and kaons) could indeed be linked to 
the equation of state [32] and that, even more spectacularly, the entire “fireball” 
of mutually stopped hadrons developed decay modes very closely resembling the 
initial predictions of hydrodynamical shock flow modes [33] which directly link 
primordial pressure gradients with collective velocity fields of matter streaming out, 
again governed by the nuclear/hadronic matter EOS. Actually, both these statements 
do, in fact, apply (mutatis mutandis) up to the present ultra-relativistic energies (see 
Sects. 7.2—7.4). However it turned out soon that the equation of state at low or even 
Zero temperature (as required in supernova and neutron star studies) could only be 
obtained in a semi-empirical manner [34]. The reason: compression can, in such 
collisions, be only accomplished along with temperature and entropy increase. In 
an ideal baryon gas exp(s/A) « T?/?/o, i.e. T? will grow faster than ọ in a 
non-isentropic compression. Thus the reaction dynamics will be sensitive to various 
isothermes of the ground state EOS P = f(o, T = 0), staying at T > O, 
throughout, and, moreover, not at constant T. Thus a relativistic dynamical mean 
field model is required in order to interactively deduce the T — 0 EOS from data 
[34]. The EOS result thus remains model dependent. 

The ideas concerning creation of a quark gluon plasma arose almost concurrent 
with the heavy ion shock compression proposal. In 1974 T.D. Lee formulated the 
idea that the non-perturbative vacuum condensates could be “melted down ...by 
distributing high energy or high nucleon density over a relatively large volume" 
[35]. Collins and Perry [36] realized that the asymptotic freedom property of QCD 
implies the existence of an ultra-hot form of matter with deconfined quarks and 
gluons, an idea that gained wide recognition when S. Weinberg [37] proposed an 
asymptotic freedom phase at the beginning of "The first Three minutes". In fact, 
this idea of deconfinement by asymptotic freedom (with implied temperature of 
several GeV) was correct, but somewhat besides the point, as everybody expected, 
likewise, that deconfinement sets in right above the limiting hadron temperature of 
R. Hagedorn [38], Ty ~ 160 MeV. A medium existing down to that temperature 
would, however, feature an average momentum square transfer Q? < 1 GeV’, ie. 
be far into the non perturbative domain, and very far from asymptotic freedom. 
Right above the hadron to parton transition the “quark gluon plasma” (as it was 
named by E. Shuryak [39]) is not a weakly coupled ideal pQCD gas as soon became 


7 Relativistic Nucleus-Nucleus Collisions and the QCD Matter Phase Diagram 319 


obvious by Lattice QCD calculations for extended matter [40]. Seen in retrospect 
one obviously cannot defend a picture of point like quarks (with "current" masses) 
at Q? < 0.2 GeV? where size scales of 0.5 to 1 fm must play a dominating role. 

An analytic QCD description of deconfinement does not exist. For heavy 
quarkonia, cc (J / V) and bb (Y) deconfinement in partonic matter, Matsui and Satz 
proposed [41] a Debye screening mechanism, caused by the high spatial density of 
free color carriers, that removes the confining long range potential as T increases 
toward about 2 T,, an effect reproduced by modern lattice QCD [42]. However, light 
hadron deconfinement cannot be understood with a non-relativistic potential model. 
Such critical remarks not withstanding, we shall demonstrate in Sects. 7.3—7.6 that 
the very existence, and also crucial properties of the QGP can in fact be inferred 
from experiment, and be confronted with corresponding predictions of recent lattice 
QCD theory. 

Our present level of an initial understanding of the phase diagram of QCD matter 
(Fig. 7.1), is the result of a steady development of both experiment and theory, that 
began about three decades ago, deriving initial momentum from the Bevalac physics 
at LBL which motivated—along with the developing formulation of the quark gluon 
plasma research goals—a succession of experimental facilities progressing toward 
higher ./s. Beginning with the AGS at BNL (Si and !?"Au beams with /s < 
5 GeV), the next steps were taken at the CERN SPS (V/s from 6 to 20 GeV; 160, 
325, 208Pb beams), and at the Relativistic Heavy Ion Collider RHIC (the first facility 
constructed explicitly for nuclear collisions) which offers beams of Cu and 1% Au 
at 20 < ./s < 200 GeV. A final, gigantic step in energy will be taken 2008 with the 
CERN Large Hadron Collider: ?°8Pb beams at /s = 5.5 TeV. 


7.2 Bulk Hadron Production in A+A Collisions 


In this section we take an overall look at bulk hadron production in nucleus- 
nucleus collisions. In view of the high total c.m. energies involved at e.g. top SPS 
(E x 3.3 TeV) and top RHIC (38 TeV) energies, in central Pb+Pb (SPS) and 
Au+Au (RHIC) collisions, one can expect an extraordinarily high spatial density of 
produced particles. Thus, as an overall idea of analysis, one will try to relate the 
observed flow of energy into transverse and longitudinal phase space and particle 
species to the high energy density contained in the primordial interaction volume, 
thus to infer about its contained matter. The typical experimental patterns of such 
collisions, both in collider mode at RHIC and in a fixed target configuration at the 
SPS, are illustrated in Fig. 7.2 which shows a fractional view of the total distribution 
of charged particles (about 4000 and 1600, respectively) within the tracking volume 
of the STAR and NA49 experiments. 

Most of these tracks correspond to “thermal” pions (pr up to 2 GeV) and, in 
general, such thermal hadrons make up for about 95% of the observed multiplicity: 
the bulk of hadron production. Their distributions in phase space will be illustrated 
in the subsections below. This will lead to a first insight into the overall reaction 
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Fig. 7.2 Charged particle tracks in central Au+Au and Pb+Pb collision events, in collider 
geometry (top) from RHIC STAR TPC tracking at ./s = 200 GeV, and in fixed target geometry 
(bottom) from NA49 at the SPS, ./s = 17.3 GeV 


dynamics, and also set the stage for consideration of the rare signals, imbedded in 
this thermal bulk production: correlations, jets, heavy flavors, fluctuations, which 
are the subject of later chapters. 


7.2.1 Particle Multiplicity and Transverse Energy Density 


Particle production can be assessed globally by the total created transverse energy, 
the overall result of the collisional creation of transverse momentum pr or 


transverse mass (mp = ,/ p? + mô), at the microscopic level. Figure 7.3 shows 
the distribution of total transverse energy Er = »; E(0j) - sin O resulting from a 


calorimetric measurement of energy flow into calorimeter cells centered at angle ©; 
relative to the beam [43], for ??S + !?7 Au collisions at A/s = 20 GeV, and for 208pp 
+ 208pp collisions at /s = 17.3 GeV. 

The shape is characteristic of the impact parameter probability distribution (for 
equal size spheres in the Pb+Pb case). The turnoff at Er = 520 GeV indicates 
the point where geometry runs out of steam, i.e. where b — 0, a configuration 
generally referred to as a “central collision". The adjacent shoulder results from 
genuine event by event fluctuations of the actual number of participant nucleons 
from target and projectile (recall the diffuse Woods-Saxon nuclear density profiles), 
and from experimental factors like calorimeter resolution and limited acceptance. 
The latter covers 1.3 units of pseudo-rapidity and contains mid-rapidity nmia = 2.9. 
Re-normalizing [43] to A 7n = 1 leads to dET/dr (mid) = 400 GeV, in agreement 
with the corresponding WA80 result [44]. Also, the total transverse energy of central 
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Pb+Pb collisions at ./s = 17.3 GeV turns out to be about 1.2 TeV. As the definition 
of a central collision, indicated in Fig.7.3, can be shown [42] to correspond to 
an average nucleon participant number of Npart = 370 one finds an average total 
transverse energy per nucleon pair, of ET/ (0.5 Npart) = 6.5 GeV. After proper 
consideration of the baryon pair rest mass (not contained in the calorimetric ET 
response but in the corresponding ./s) one concludes [43] that the observed total 
Er corresponds to about 0.6 EIS the maximal Er derived from a situation of 
“complete stopping" in which the incident ./s gets fully transformed into internal 
excitation of a single, ideal isotropic fireball located at mid-rapidity. The remaining 
fraction of Ey“ thus stays in longitudinal motion, reflecting the onset, at SPS 
energy, of a transition from a central fireball to a longitudinally extended "fire-tube", 
i.e. a cylindrical volume of high primordial energy density. In the limit of much 
higher ./s one may extrapolate to the idealization of a boost invariant primordial 
interaction volume, introduced by Bjorken [45]. 

We shall show below (Sect. 7.2.2) that the charged particle rapidity distributions, 
from top SPS to top RHIC energies, do in fact substantiate a development toward a 
boost-invariant situation. One may thus employ the Bjorken model for an estimate 
of the primordial spatial energy density e, related to the energy density in rapidity 
space via the relation [45] 


1 1 dEr 


— — — 7.1 
xR? tj dy Ub 


€(10) = 
where the initially produced collision volume is considered as a cylinder of length 
dz = ro dy and transverse radius R œ A!/. Inserting for 2 R? the longitudinally 
projected overlap area of Pb nuclei colliding near head-on ("centrally"), and 
assuming that the evolution of primordial pQCD shower multiplication (i.e. the 
energy transformation into internal degrees of freedom) proceeds at a time scale 
to < 1 fm/c, the above average transverse energy density, of dEr/dy = 400 GeV 
at top SPS energy [43, 44] leads to the estimate 


€(t = 1 fm) 2 3.0 0.6 GeV/fm’, (7.2) 
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Fig. 7.4 Charged hadron 
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thus exceeding, by far, the estimate of the critical energy density eo obtained from 
lattice QCD (see below), of about 1.0 GeV/fm?. Increasing the collision energy 
to ./s = 200 GeV for Au+Au at RHIC, and keeping the same formation time, 
to = 1 fm/c (a conservative estimate as we shall show in Sect. 7.2.4), the Bjorken 
estimate grows to € © 6.0 + 1 GeV/fm?. This statement is based on the increase 
of charged particle multiplicity density at mid-rapidity with 4/s, as illustrated in 
Fig. 7.4. From top SPS to top RHIC energy [46] the density per participant nucleon 
pair almost doubles. However, at ./s = 200 GeV the formation or thermalization 
time ro, employed in the Bjorken model [45], was argued [47] to be shorter by a 
factor of about 4. We will return to such estimates of ro in Sect. 7.2.5 but note, for 
now, that the above choice of tọ = 1 fm/c represents a conservative upper limit at 
RHIC energy. 

These Bjorken-estimates of spatial transverse energy density are confronted in 
Fig. 7.5 with lattice QCD results obtained for three dynamical light quark flavors 
[48], and for zero baryo-chemical potential (as is realistic for RHIC energy and 
beyond but still remains a fair approximation at top SPS energy where ug ~ 
250 MeV). The energy density of an ideal, relativistic parton gas scales with the 


7 Relativistic Nucleus-Nucleus Collisions and the QCD Matter Phase Diagram 323 


fourth power of the temperature, 
e — gT^ (7.3) 


where g is related to the number of degrees of freedom. For an ideal gluon gas, 
g = 16 12/30; in an interacting system the effective g is smaller. The results of 
Fig. 7.5 show, in fact, that the Stefan-Boltzmann limit esp is not reached, due to non 
perturbative effects, even at four times the critical temperature Te = 170 MeV. The 
density e/ T^ = g is seen to ascend steeply, within the interval T. + 25 MeV. At T; 
the critical QCD energy density € = 0.6-1.0 GeV/fm?. Relating the thermal energy 
density with the Bjorken estimates discussed above, one arrives at an estimate of the 
initial temperatures reached in nucleus-nucleus collisions, thus implying thermal 
partonic equilibrium to be accomplished at time scale tọ (see Sect. 7.2.5). For 
the SPS, RHIC and LHC energy domains this gives an initial temperature in the 
range 190 « TSPS < 220 MeV, 220 « TRHIC < 400 MeV (assuming [47] 
that ry decreases to about 0.3 fm/c here) and ye > 600 MeV, respectively. 
From such estimates one tends to conclude that the immediate vicinity of the phase 
transformation is sampled at SPS energy, whereas the dynamical evolution at RHIC 
and LHC energies dives deeply into the “quark-gluon-plasma” domain of QCD. We 
shall return to a more critical discussion of such ascertations in Sect. 7.2.5. 

One further aspect of the mid-rapidity charged particle densities per participant 
pair requires attention: the comparison with data from elementary collisions. 
Figure 7.4 shows a compilation of pp, pp and e*e~ data covering the range from 
ISR to LEP and Tevatron energies. 

The data from ete~ represent d Nn /dy, the rapidity density along the event 
thrust axis, calculated assuming the pion mass [49] (the difference between dN /dy 
and dN /dņ can be ignored here). Remarkably, they superimpose with the central 
A+A collision data, whereas pp and pp show similar slope but amount to only about 
60% of the AA and e*e™ values. This difference between e*e~ annihilation to 
hadrons, and pp or pp hadro-production has been ascribed [50] to the characteristic 
leading particle effect of minimum bias hadron-hadron collisions which is absent 
in ete. It thus appears to be reduced in AA collisions due to subsequent 
interaction of the leading parton with the oncoming thickness of the remaining 
target/projectile density distribution. This naturally leads to the scaling of total 
particle production with Npart that is illustrated in Fig. 7.6, for three RHIC energies 
and minimum bias Au+Au collisions; the close agreement with e*e- annihilation 
data is obvious again. One might conclude that, analogously, the participating 
nucleons get “annihilated” at high 4/s, their net quantum number content being 
spread out over phase space (as we shall show in the next section). 
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7.2.2 Rapidity Distributions 


Particle production number in A+A collisions depends globally on 4/s and collision 
centrality, and differentially on pr and rapidity y, for each particle species i. 
Integrating over pr results in the rapidity distribution dN;/dy. Particle rapidity,! 


y = sinh"! p /Mr (where Mz = ,/m?2 + py. requires mass identification. If that 
is unknown one employs pseudo-rapidity (7 = — In [tan(O /2)]) instead. This is 
also chosen if the joint rapidity distribution of several unresolved particle species 
is considered: notably the charged hadron distribution. We show two examples 
in Fig. 7.7. The left panel illustrates charged particle production in pp collisions 
studied by UAI at ./s = 540 GeV [51]. Whereas the minimum bias distribution 
(dots) exhibits the required symmetry about the center of mass coordinate, 7 = 
0, the rapidity distribution corresponding to events in which a W boson was 
produced (histogram) features, both, a higher average charged particle yield, and an 
asymmetric shape. The former effect can be seen to reflect the expectation that the 
W production rate increases with the “centrality” of pp collisions, involving more 
primordial partons as the collisional overlap of the partonic density profiles gets 
larger, thus also increasing the overall, softer hadro-production rate. The asymmetry 
should result from a detector bias favoring W identification at negative rapidity: 
the transverse W energy, of about 100 GeV would locally deplete the energy store 


'The rapidity variable represents a compact (logarithmic) description of longitudinal phase space. 
It is based on longitudinal particle velocity (derived from pjong and m), y = 1/21n((1 + 8L)/(1 — 
BL)). The rapidity distribution dN /dy is shape invariant under longitudinal Lorentz transformation, 
and centered at “mid-rapidity” ymia = ycm, for all produced particle species; see Figs. 7.7, 7.8, 7.9, 
and 7.10. 
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Fig. 7.7 Left panel: charged particle pseudo-rapidity distribution in pp collisions at /s = 
540 GeV [51]. Right panel: same in RHIC Au-Au collisions at ./s = 130 GeV at different 
centralities [52]. Closed lines represent fits with the color glass condensate model [64] 


available for associated soft production. If correct, this interpretation suggests that 
the wide rapidity gap between target and projectile, arising at such high Js, of width 
Ay © 2 In (2ycm), makes it possible to define local sub-intervals of rapidity within 
which the species composition of produced particles varies. 

The right panel of Fig. 7.7 shows charged particle pseudo-rapidity density dis- 
tributions for Au Au collisions at ./s = 130 GeV measured by RHIC experiment 
PHOBOS [52] at three different collision centralities, from "central" (the 696 highest 
charged particle multiplicity events) to semi-peripheral (the corresponding 35—45% 
cut). We will turn to centrality selection in more detail below. Let us first remark 
that the slight dip at mid-rapidity and, moreover, the distribution shape in general, 
are common to pp and Au-Au. This is also the case for ete~ annihilation as is 
shown in Fig. 7.8 which compares the ALEPH rapidity distribution along the mean 
pr (“thrust”) axis of jet production in e* e^ at /s = 200 GeV [49] with the scaled 
PHOBOS-RHIC distribution of central Au+Au at the same ./s [53]. Note that the 
mid-rapidity values contained in Figs. 7.7 and 7.8 have been employed already in 
Fig. 7.4, which showed the overall ./s dependence of mid-rapidity charged particle 
production. What we concluded there was a perfect scaling of A+A with e*e^ 
data at ./s > 20 GeV and a 40% suppression of the corresponding pp, pp 
yields. We see here that this observation holds, semi-quantitatively, for the entire 
rapidity distributions. These are not ideally boost invariant at the energies considered 
here but one sees in dNen/dy a relatively smooth “plateau” region extending over 
| y |< 1.52.5. 

The production spectrum of charged hadrons is, by far, dominated by soft pions 
(pr < 1 GeV/c) which contribute about 85% of the total yield, both in elementary 
and nuclear collisions. The evolution of the x~ rapidity distribution with /s is 
illustrated in Fig. 7.9 for central Au+Au and Pb+Pb collisions from AGS via SPS to 
RHIC energy, 2.7 < ./s < 200 GeV [54]. 
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Fig. 7.8 Pseudo-rapidity 
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At lower ./s the distributions are well described by single Gaussian fits [54] with 
c (y) nearly linearly proportional to the total rapidity gap Ay c In./s as shown in 
the right hand panel of Fig. 7.9. Also illustrated is the prediction of the schematic 
hydrodynamical model proposed by Landau [55], 


c? « In ( ve ) (7.4) 


2m, 


which pictures hadron production in high ys pp collisions to proceed via a 
dynamics of initial complete "stopping down" of the reactants matter/energy 
content in a mid-rapidity fireball that would then expand via 1-dimensional ideal 
hydrodynamics. Remarkably, this model that has always been considered a wildly 
extremal proposal falls rather close to the lower 4/s data for central A+A collisions 
but, as longitudinal phase space widens approaching boost invariance we expect that 
the (non-Gaussian) width of the rapidity distribution grows linearly with the rapidity 
gap Ay. LHC data will finally confirm this expectation, but Figs. 7.7, 7.8, and 7.9 
clearly show the advent of boost invariance, already at ./s = 200 GeV. 

A short didactic aside: At low ,/s the total rapidity gap Ay = 2-3 does closely 
resemble the total rapidity width obtained for a thermal pion velocity distribution at 
temperature T = 120-150 MeV, of a single mid-rapidity fireball, the y-distribution 
of which represents the longitudinal component according to the relation [19] 


dN ? 2mT? 27 
— «[m*T + — + -a exp[—m cosh y/ T] (7.5) 
dy cosh y cosh y 
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Fig. 7.9 Left panel: negative pion rapidity distributions in central Au+Au and Pb+Pb collisions 
from AGS via SPS to RHIC energies [54]. Right panel: the Gaussian rapidity width of pions versus 
A/s, confronted by Landau model predictions (solid line) [54] 


where m is the pion mass. Any model of preferentially longitudinal expansion of 
the pion emitting source, away from a trivial single central "completely stopped" 
fireball, can be significantly tested only once Ay > 3 which occurs upward from 
SPS energy. The agreement of the Landau model prediction with the data in Fig. 7.9 
is thus fortuitous, below ./s œ% 10 GeV, as any created fireball occupies the entire 
rapidity gap with pions. 

The Landau model offers an extreme view of the mechanism of “stopping”, 
by which the initial longitudinal energy of the projectile partons or nucleons 
is inelastically transferred to produced particles and redistributed in transverse 
and longitudinal phase space, of which we saw the total transverse fraction in 
Fig. 7.3. Obviously ete annihilation to hadrons represents the extreme stopping 
situation. Hadronic and nuclear collisions offer the possibility to analyze the final 
distribution in phase space of their non-zero net quantum numbers, notably net 
baryon number. Figure 7.10 shows the net-proton rapidity distribution (i.e. the 
proton rapidity distribution subtracted by the antiproton distribution) for central 
Pb+Pb/Au+Au collisions at AGS (Vs = 5.5 GeV), SPS (./s < 17.3 GeV) and 
RHIC (./s = 200 GeV) [56]. With increasing energy we see a central (but non- 
Gaussian) peak developing into a double-hump structure that widens toward RHIC 
leaving a plateau about mid-rapidity. The RHIC-BRAHMS experiment acceptance 
for p, p identification does unfortunately not reach up to the beam fragmentation 
domain at yp = 5.4 (nor does any other RHIC experiment) but only to y ~ 3.2, 
with the consequence that the major fraction of p"* is not accounted for. However 
the mid-rapidity region is by no means net baryon free. At SPS energy the NA49 
acceptance covers the major part of the total rapidity gap, and we observe in 
detail a net p distribution shifted down from y, = 2.9 by an average rapidity 
shift [56] of (Sy) = 1.7. From Fig. 7.10 we infer that (dy) cannot scale linearly 
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with yp © In(2ycw) © In J/s for ever—as it does up to top SPS energy where 
(Sy) = 0.58 y, [56]. Because extrapolating this relation to /s = 200 GeV would 
result in (dy) = 3.1, and with yp ~ 5.4 at this energy we would expect to observe a 
major fraction of net proton yield in the vicinity of y — 2.3 which is not the case. A 
saturation must thus occur in the (dy) vs. ./s dependence. 

The re-distribution of net baryon density over longitudinal phase space is, of 
course, only partially captured by the net proton yield but a recent study [57] has 
shown that proper inclusion of neutron? and hyperon production data at SPS and 
RHIC energy scales up, of course, the dN /dy distributions of Fig. 7.10 but leaves 
the peculiarities of their shapes essentially unchanged. As the net baryon rapidity 
density distribution should resemble the final valence quark distribution the Landau 
model is ruled out as the valence quarks are seen to be streaming from their initial 
position at beam rapidity toward mid-rapidity (not vice versa). It is remarkable, 
however, to see that some fraction gets transported very far, during the primordial 
partonic non-equilibrium phase. We shall turn to its theoretical description in 
Sect. 7.2.4 but note, for now, that pp collisions studied at the CERN ISR [58] lead 
to a qualitatively similar net baryon rapidity distribution, albeit characterized by a 
smaller (8y). 


?Neutrons are not directly measured in the SPS and RHIC experiments but their production rate, 
relative to protons, reflects in the ratio of tritium to ?He production measured by NA49 [57], 
applying the isospin mirror symmetry of the corresponding nuclear wave functions. 
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The data described above suggest that the stopping mechanism universally 
resides in the primordial, first generation of collisions at the microscopic level. The 
rapidity distributions of charged particle multiplicity, transverse energy and valence 
quark exhibit qualitatively similar shapes (which also evolve similarly with /s) 
in pp, pp, e*e- reactions, on the one hand, and in central or semi-peripheral 
collisions of A 7 200 nuclei, on the other. Comparing in detail we formulate a 
nuclear modification factor for the bulk hadron rapidity distributions, 


RAA daN®/dy (y) in A+A we 
" 0.5 Npat dN®/dy in 7-0) 
part y pp 


where Npart < 2A is the mean number of “participating nucleons” (which undergo 
at least one inelastic collision with another nucleon) which increases with collision 
centrality. For identical nuclei colliding (NS) & (oat) and thus 0.5 Npart gives 
the number of opposing nucleon pairs. RA“ = 1 if each such “opposing” pair 
contributes the same fraction to the total A+A yield as is produced in minimum bias 
pp at similar ./s. From Figs. 7.4 and 7.6 we infer that for | n |< 1, RAA = 1.5 at top 
RHIC energy, and for the pseudo-rapidity integrated total N“ we find RA“ = 1.36, 
in central Au+Au collisions. AA collisions thus provide for a higher stopping power 
than pp (which is also reflected in the higher rapidity shift (Sy) of Fig. 7.10). The 
observation that their stopping power resembles the ee~ inelasticity suggests a 
substantially reduced leading particle effect in central collisions of heavy nuclei. 
This might not be surprising. In a Glauber-view of successive minimum bias nucleon 
collisions occurring during interpenetration, each participating nucleon is struck 
v > 3 times on average, which might saturate the possible inelasticity, removing 
the leading fragment. 

This view naturally leads to the scaling of the total particle production in nuclear 
collisions with Npart, as seen clearly in Fig.7.6, reminiscent of the “wounded 
nucleon model" [59] but with the scaling factor determined by e*e7 rather than pp 
[60]. Overall we conclude from the still rather close similarity between nuclear and 
elementary collisions that the mechanisms of longitudinal phase space population 
occur primordially, during interpenetration which is over after 0.15 fm/c at RHIC, 
and after 1.5 fm/c at SPS energy. Le. it is the primordial non-equilibrium pQCD 
shower evolution that accounts for stopping, and its time extent should be a lower 
limit to the formation time tọ employed in the Bjorken model [45], Eq. (7.1). 
Equilibration at the partonic level might begin at t > ro only (the development 
toward a quark-gluon-plasma phase), but the primordial parton redistribution 
processes set the stage for this phase, and control the relaxation time scales involved 
in equilibration [61]. More about this in Sect. 7.2.5. We infer the existence of a 
saturation scale [62] controlling the total inelasticity: with ever higher reactant 
thickness, proportional to A!/?, one does not get a total rapidity or energy density 
proportional to A^? (the number of “successive binary collisions") but to A! 99 only 
[63]. Note that the lines shown in Fig. 7.7 (right panel) refer to such a saturation 
theory: the color glass condensate (CGC) model [64] developed by McLerran and 
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Venugopulan. The success of these models demonstrates that “successive binary 
baryon scattering" is not an appropriate picture at high ,/s. One can free the partons 
from the nucleonic parton density distributions only once, and their corresponding 
transverse areal density sets the stage for the ensuing QCD parton shower evolution 
[62]. Moreover, an additional saturation effect appears to modify this evolution at 
high transverse areal parton density (see Sect. 7.2.4). 


7.2.3 Dependence on System Size 


We have discussed above a first attempt toward a variable (Npart) that scales the 
system size dependence in A+A collisions. Note that one can vary the size either 
by centrally colliding a sequence of nuclei, Ay + A1, A2 + A» etc., or by selecting 
different windows in Npart out of minimum bias collision ensembles obtained for 
heavy nuclei for which BNL employs !?" Au and CERN ?9Pb, The third alternative, 
scattering a relatively light projectile, such as ??S, from increasing A nuclear targets, 
has been employed initially both at the AGS and SPS but got disfavored in view 
of numerous disadvantages, of both experimental (the need to measure the entire 
rapidity distribution, i.e. lab momenta from about 0.3-100 GeV/c, with uniform 
efficiency) and theoretical nature (different density distributions of projectile and 
target; occurrence of an "effective" center of mass, different for hard and soft 
collisions, and depending on impact parameter). 

The determination of Npart is of central interest, and thus we need to look at 
technicalities, briefly. The approximate linear scaling with Npar that we observed in 
the total transverse energy and the total charged particle number (Figs. 7.3 and 7.6) 
is a reflection of the primordial redistribution of partons and energy. Whereas all 
observable properties that refer to the system evolution at later times, which are of 
interest as potential signals from the equilibrium, QCD plasma “matter” phase, have 
different specific dependences on Npart, be it suppressions (high pr signals, jets, 
quarkonia production) or enhancements (collective hydrodynamic flow, strangeness 
production). Npart thus emerges as a suitable common reference scale. 

Npart captures the number of potentially directly hit nucleons. It is estimated from 
an eikonal straight trajectory Glauber model as applied to the overlap region arising, 
in dependence of impact parameter b, from the superposition along beam direction 
of the two initial Woods-Saxon density distributions of the interacting nuclei. To 
account for the dilute surfaces of these distributions (within which the intersecting 
nucleons might not find an interaction partner) each incident nucleon trajectory gets 
equipped with a transverse radius that represents the total inelastic NN cross section 
at the corresponding 4/s. The formalism is imbedded into a Monte Carlo simulation 
(for detail see [66]) starting from random microscopic nucleon positions within 
the transversely projected initial Woods-Saxon density profiles. Overlapping cross 
sectional tubes of target and projectile nucleons are counted as a participant nucleon 
pair. Owing to the statistics of nucleon initial position sampling each considered 
impact parameter geometry thus results in a probability distribution of derived Npart. 
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Its width o defines the resolution A(b) of impact parameter b determination within 
this scheme via the relation 


2k c (b) 5 d (Npart(b)) 


A(b) db aD 


which, at A = 200, leads to the expectation to determine b with about 1.5 fm 
resolution [66], by measuring Npar. 

How to measure Npart? In fixed target experiments one can calorimetrically 
count all particles with beam momentum per nucleon and superimposed Fermi 
momentum distributions of nucleons, i.e. one looks for particles in the beam 
fragmentation domain ypeam + 0.5, pr < 0.25 GeV/c. These are identified as 


spectator nucleons, and NER —A-— Nipel- For identical nuclear collision systems 


part part /> part * 
employed in the CERN experiments NA49 and WASO, and generalized [67] in a 
way that is illustrated in Fig. 7.11. 

The top panel shows the minimum bias distribution of total energy registered in 
a forward calorimeter that covers the beam fragment domain in Pb+Pb collisions 
at lab. energy of 158 GeV per projectile nucleon, ./s = 17.3 GeV. The energy 
spectrum extends from about 3 TeV which corresponds to about 20 projectile 
spectators (indicating a "central" collision), to about 32 TeV which is close to the 
total beam energy and thus corresponds to extremely peripheral collisions. Note 
that the shape of this forward energy spectrum is the mirror image of the minimum 
bias transverse energy distribution of Fig. 7.3, both recorded by NA49. From both 
figures we see that the ideal head-on, b — O collision cannot be selected from these 
(or any other) data, owing to the facts that b = 0 carries zero geometrical weight, 


(N 3 E = ( N, E and thus Npar gets approximated by 2N Proj This scheme was 
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Fig. 7.11 (a) Energy spectrum of the forward calorimeter in Pb+Pb collisions at 158A GeV; (b) 
impact parameter and fraction of total inelastic cross section related to forward energy from the 
VENUS model [67, 68] 
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and that the diffuse Woods-Saxon nuclear density profiles lead to a fluctuation of 
participant nucleon number at given finite b. Thus the Npar fluctuation at finite 
weight impact parameters overshadows the genuinely small contribution of near 
zero impact parameters. Selecting "central" collisions, either by an on-line trigger 
cut on minimal forward energy or maximal total transverse energy or charged 
particle rapidity density, or by corresponding off-line selection, one thus faces a 
compromise between event statistics and selectivity for impact parameters near 
zero. In the example of Fig. 7.11 these considerations suggest a cut at about 8 TeV 
which selects the 5% most inelastic events, from among the overall minimum bias 
distribution, then to be labeled as "central" collisions. This selection corresponds to 
a soft cutoff at b < 3 fm. 

The selectivity of this, or of other less stringent cuts on collision centrality is 
then established by comparison to a Glauber or cascade model. The bottom panel of 
Fig. 7.11 employs the VENUS hadron/string cascade model [68] which starts from a 
Monte Carlo position sampling of the nucleons imbedded in Woods-Saxon nuclear 
density profiles but (unlike in a Glauber scheme with straight trajectory overlap 
projection) following the cascade of inelastic hadron/string multiplication, again by 
Monte Carlo sampling. It reproduces the forward energy data reasonably well and 
one can thus read off the average impact parameter and participant nucleon number 
corresponding to any desired cut on the percent fraction of the total minimum bias 
cross section. Moreover, it is clear that this procedure can also be based on the total 
minimum bias transverse energy distribution, Fig. 7.3, which is the mirror image 
of the forward energy distribution in Fig. 7.11, or on the total, and even the mid- 
rapidity charged particle density (Fig. 7.6). The latter method is employed by the 
RHIC experiments STAR and PHENIX. 

How well this machinery works is illustrated in Fig. 7.12 by RHIC-PHOBOS 
results at ./s = 200 GeV [52]. The charged particle pseudo-rapidity density 
distributions are shown for central (3-696 highest Nen cut) Cu+Cu collisions, with 
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Fig. 7.13 Charged pion = 
multiplicity normalized by NA49, vs w = 17.3 GeV 
Nw vs. centrality in p+p, L 
C+C, Si+Si and Pb+Pb 
collisions at ./s = 17.3 GeV 
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(Npart) = 100, and semi-peripheral Au+Au collisions selecting the cut window (35- 
40%) such that the same (Noart) emerges. The distributions are nearly identical. 
In extrapolation to Npat = 2 one would expect to find agreement between min. 
bias p+p, and "super-peripheral" A+A collisions, at least at high energy where 
the nuclear Fermi momentum plays no large role. Figure 7.13 shows that this 
expectation is correct [69]. As it is technically difficult to select Npat = 2 from 
A — 200 nuclei colliding, NA49 fragmented the incident SPS Pb beam to study 
12C+!?C and ?5Si4?Si collisions [67]. These systems are isospin symmetric, and 
Fig. 7.13 thus plots 0.5 (x+) + (x7)/ (Nw) including p+p where Nw = 2 by 
definition. We see that the pion multiplicity of A+A collisions interpolates to the 
p+p data point. 

Note that NA49 employs the term “wounded nucleon” number (Nw) to count 
the nucleons that underwent at least one inelastic nucleon-nucleon collision. This is 
what the RHIC experiments (that follow a Glauber model) call Npart whereas NA49 
reserves this term for nucleons that underwent any inelastic collision. Thus Nw in 
Fig. 7.13 has the same definition as Npart in Figs. 7.4, 7.6, 7.8, and 7.12. We see 
that a smooth increase joins the p+p data, via the light A+A central collisions, to 
a saturation setting in with semi-peripheral Pb+Pb collisions, the overall, relative 
increase amounting to about 40% (as we saw in Fig. 7.4). 

There is nothing like an Ny increase (the thickness of the reactants) observed 
here, pointing to the saturation mechanism(s) mentioned in the previous section, 
which are seen from Fig. 7.13 to dampen the initial, fast increase once the primordial 
interaction volume contains about 80 nucleons. In the Glauber model view of 
successive collisions (to which we attach only symbolical significance at high V/s) 
this volume corresponds to (v) ~ 3, and within the terminology of such models we 
might thus argue, intuitively, that the initial geometrical cross section, attached to 
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the nucleon structure function as a whole, has disappeared at (v) 7 3, all constituent 
partons being freed. 


7.2.4 Gluon Saturation in A+A Collisions 


We will now take a closer look at the saturation phenomena of high energy QCD 
scattering, and apply results obtained for deep inelastic electron-proton reactions to 
nuclear collisions, a procedure that relies on a universality of high energy hadron 
scattering. This arises at high ./s, and at relatively low momentum transfer squared 
Q? (the condition governing bulk charged particle production near mid-rapidity at 
RHIC, where Feynman x ~ 0.01 and Q? < 5 GeV’). Universality comes about 
as the transverse resolution becomes higher and higher, with Q?, so that within the 
small area tested by the collision there is no difference whether the partons sampled 
there belong to the transverse gluon and quark density projection of any hadron 
species, or even of a nucleus. And saturation arises once the areal transverse parton 
density exceeds the resolution, leading to interfering QCD sub-amplitudes that do 
not reflect in the total cross section in a manner similar to the mere summation of 
separate, resolved color charges [61—65, 70, 71]. 

The ideas of saturation and universality are motivated by HERA deep inelastic 
scattering (DIS) data [72] on the gluon distribution function shown in Fig. 7.14 
(left side). The gluon rapidity density, xG(x, Q?) = (dN®™°")/(dy) rises rapidly 
as a function of decreasing fractional momentum, x, or increasing resolution, Q?. 
The origin of this rise in the gluon density is, ultimately, the non-abelian nature of 
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Fig. 7.14 (Left) HERA data for the gluon distribution function as a function of fractional 
momentum x and square momentum transfer Q? [72]. (Right) Saturation of gluons in a hadron; a 
head on view as x decreases [75] 
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QCD. Due to the intrinsic non-linearity of QCD [70, 71], gluon showers generate 
more gluon showers, producing an avalanche toward small x. As a consequence 
of this exponential growth the spatial density of gluons (per unit transverse area 
per unit rapidity) of any hadron or nucleus must increase as x decreases [65]. 
This follows because the transverse size, as seen via the total cross section, rises 
more slowly toward higher energy than the number of gluons. This is illustrated 
in Fig. 7.14 (right side). In a head-on view of a hadronic projectile more and more 
partons (mostly gluons) appear as x decreases. This picture reflects a representation 
of the hadron in the "infinite momentum frame" where it has a large light-cone 
longitudinal momentum P* >> M. In this frame one can describe the hadron wave 
function as a collection of constituents carrying a fraction p* = x P*, 0 €x x <1, 
of the total longitudinal momentum [73] (“light cone quantization" method [74 ]). In 
DIS at large sqrts and Q? one measures the quark distributions dNq/dx at small x, 
deriving from this the gluon distributions xG(x, Q?) of Fig. 7.14. 

It is useful [75] to consider the rapidity distribution implied by the parton 
distributions, in this picture. Defining y = yhaaron — In(1/x) as the rapidity of the 
potentially struck parton, the invariant rapidity distribution results as 


dN/dy = x dN/dx = xG(x, Q”). (7.8) 


At high Q? the measured quark and gluon structure functions are thus simply related 
to the number of partons per unit rapidity, resolved in the hadronic wave function. 
The above textbook level [74, 75] recapitulation leads, however, to an important 
application: the dN /dy distribution of constituent partons of a hadron (or nucleus), 
determined by the DIS experiments, is similar to the rapidity distribution of 
produced particles in hadron-hadron or A+A collisions as we expect the initial 
gluon rapidity density to be represented in the finally observed, produced hadrons, 
at high v/s. Due to the longitudinal boost invariance of the rapidity distribution, we 
can apply the above conclusions to hadron-hadron or A+A collisions at high /s, 
by replacing the infinite momentum frame hadron rapidity by the center of mass 
frame projectile rapidity, Yproj while retaining the result that the rapidity density of 
potentially interacting partons grows with increasing distance from yp;oj like 


Ay = yproj — y = In(1/x). (7.9) 


At RHIC energy, /s = 200 GeV, Ay at mid-rapidity thus corresponds to x < 107? 
(well into the domain of growing structure function gluon density, Fig. 7.14), and 
the two intersecting partonic transverse density distributions thus attempt to resolve 
each other given the densely packed situation that is depicted in the lower circle 
of Fig. 7.14 (right panel). At given Q? (which is modest, Q? « 5 GeV?, for bulk 
hadron production at mid-rapidity) the packing density at mid-rapidity will increase 
toward higher ./s as 


Aymidrap ~ In(/s/M), ie. 1/x  /s/M (7.10) 
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thus sampling smaller x domains in Fig. 7.14 according to Eq. (7.9). It will further 
increase in proceeding from hadronic to nuclear reaction partners A+A. Will it be 
in proportion to A*/3? We know from the previous sections (Sects. 7.2.2 and 7.2.3) 
that this is not the case, the data indicating an increase with A109, This observation 
is, in fact caused by the parton saturation effect, to which we turn now. 

For given transverse resolution Q? and increasing 1/x the parton density of 
Fig. 7.14 becomes so large that one cannot neglect their mutual interactions any 
longer. One expects such interactions to produce “shadowing”, a decrease of 
the scattering cross section relative to incoherent independent scattering [70, 71]. 
As an effect of such shadowed interactions there occurs [75] a saturation [61— 
65, 70, 71, 75] of the cross section at each given Q?, slowing the increase with 1/x 
to become logarithmic once 1/x exceeds a certain critical value x,(Q7). Conversely, 
for fixed x, saturation occurs for transverse momenta below some critical Q? (x), 


1 dN 


— — 7.11 
mR? dy (11) 


O3 (x) = o, Ne 


where dN /dy is the x-dependent gluon density (at y = Yproj —In(1/x)). Q? is called 
the saturation scale. In Eq. (7.11) z R? is the hadron area (in transverse projection), 
and aN, is the color charge squared of a single gluon. More intuitively, Q2(x) 
defines an inversely proportional resolution area F;(x) and at each x we have to 
choose 7, (x) such that the ratio of total area z R? to F; (x) (the number of resolved 
areal pixels) equals the number of single gluon charge sources featured by the total 
hadron area. As a consequence the saturation scale Q? (x) defines a critical areal 
resolution, with two different types of QCD scattering theory defined, at each x, for 
Q? > Q? and Q? < Q?, respectively [62, 65, 75]. 

As one expects a soft transition between such theories, to occur along the 
transition line implied by Q2 (x), the two types of QCD scattering are best studied 
with processes featuring typical Q? well above, or below Q2(x). Jet production at 
/s > 200 GeV in pp or AA collisions with typical Q? above about 10? GeV?, 
clearly falls into the former class, to be described e.g. by QCD DGLAP evolution 
of partonic showers [76]. The acronym DGLAP refers to the inventors of the 
perturbative QCD evolution of parton scattering with the "running" strong coupling 
constant o, (Q?), Dokshitzer, Gribov, Levine, Altarelli and Parisi. On the other 
hand, mid-rapidity bulk hadron production at the upcoming CERN LHC facility 
(VS = 14 TeV for pp, and 5.5 TeV for A+A), with typical Q? < 5 GeV? at 
x < 1073, will present a clear case for QCD saturation physics, as formulated 
e.g. in the “Color Glass Condensate (CGC)” formalism developed by McLerran, 
Venugopalan and collaborators [64, 65, 75, 77]. This model develops a classical 
gluon field theory for the limiting case of a high areal occupation number density, 
i.e. for the conceivable limit of the situation depicted in Fig. 7.14 (right hand panel) 
where the amalgamating small x gluons would overlap completely, within any 
finite resolution area at modest Q?. Classical field theory captures, by construction, 
the effects of color charge coherence, absent in DGLAP parton cascade evolution 
theories [75]. This model appears to work well already at ./s as “low” as at 
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RHIC, as far as small Q? bulk charged particle production is concerned. We have 
illustrated this by the CGC model fits [64] to the PHOBOS charged particle rapidity 
distributions, shown in Fig. 7.7. 

Conversely, QCD processes falling in the transition region between such limiting 
conditions, such that typical Q? ~ Q2(x), should present observables that are 
functions of the ratio between the transferred momentum Q? and the appropriate 
saturation scale, expressed by Q2(x). As Q? defines the effective transverse 
sampling area, and Q? (x) the characteristic areal size at which saturation is expected 
to set in, a characteristic behavior of cross sections, namely that they are universal 
functions of Q?/ D. is called “geometric scaling". The HERA ep scattering data 
obey this scaling law closely [78], and the idea arises to apply the universality 
principle that we mentioned above: at small enough x, all hadrons or nuclei are 
similar, their specific properties only coming in via the appropriate saturation scales 
Qi, h) or Q? (x, A). Knowing the latter for RHIC conditions we will understand 
the systematics of charged particle production illustrated in the previous chapter, and 
thus also be able to extrapolate toward LHC conditions in pp and AA collisions. 

All data for the virtual photo-absorption cross section oa"? (x, Q?) in deep 
inelastic ep scattering with x < 0.01 (which is also the RHIC mid-rapidity x- 
domain) have been found [78] to lie on a single curve when plotted against Q? / Q?, 
with 

a) ~ er 1 GeV? (7.12) 
with à ~ 0.3 and xo œ 107^. This scaling [79] with c = Q?/Q? is shown 
in Fig. 7.15 (top panel) to interpolate all data. A chain of arguments, proposed 
by Armesto et al. [63] connects a fit to these data with photo-absorption data for 
(virtual) photon-A interactions [80] via the geometrical scaling ansatz 

A p = 

c Y^ (rA) _ oYP (ry = Ta) (7.13) 

zRÀ xR 
assuming that the scale in the nucleus grows with the ratio of the transverse parton 
densities, raised to the power 1/ô (a free parameter), 


1/8 1/8 
Ar R? z RÀ 
OSa = Qip ( J ; TA= Th ( A (7.14) 


2 2 
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Figure 7.15 (middle and bottom panels) shows their fit to the nuclear photo- 
absorption data which fixes 6 = 0.79 and xR? = 1.57 fm? (see ref. [63] for detail). 
The essential step in transforming these findings to the case of A+A collisions is 
then taken by the empirical ansatz 


AA 
(aty c 0) x O74 GO RÀ (7.15) 


dy 
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Fig. 7.15 (Top) Geometric 
scaling of the virtual 
photo-absorption cross 
section 0’? on protons; 
(middle) cross sections for 
nuclei normalized according 
to Eq. (7.13); (bottom) the 
ratio of o^ to a fit of o? 
(see [63] for data reference) 
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by which the mid-rapidity parton (gluon) density dN /dy in Eq. (7.11) gets related 
to the charged particle mid-rapidity density at y ~ 0 [70, 81], measured in nucleus- 
nucleus collisions. Replacing, further, the total nucleon number 2A in a collision of 
identical nuclei of mass A by the number Npart of participating nucleons, the final 
result is [63] 


1 dN^^ 


Npart dy 


(at y & 0) = No(V5)* Noan (7.16) 


where the exponent a = (1 — 8)/36 = 0.089, and No = 0.47. The exponent o is 
far smaller than 1/3, a value that represents the thickness of the reactants, and would 
be our naive guess in a picture of “successive” independent nucleon participant 
collisions, whose average number (v) œ (Npart/ 2)// 3 The observational fact (see 
Fig. 7.13) that œ < 1/3 for mid-rapidity low Q? bulk hadron production in A+A 
collisions illustrates the importance of the QCD saturation effect. This is shown 
[63] in Fig. 7.16 where Eq. (7.16) is applied to the RHIC PHOBOS data for mid- 
rapidity charged particle rapidity density per participant pair, in Au+Au collisions at 
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Fig. 7.16 Saturation model fit [63] applied to RHIC charged hadron multiplicity data at mid- 
rapidity normalized by number of participant pairs, at various energies [82]. Also shown is an 
extrapolation to pp data and a prediction for minimum bias Pb+Pb collisions at LHC energy, 


J5 = 5500 GeV 


A/s = 19.6, 130 and 200 GeV [82], also including a prediction for LHC energy. Note 
that the factorization of energy and centrality dependence, implied by the RHIC data 
[52], is well captured by Eq. (7.11) and the resulting fits in Fig. 7.16. Furthermore, 
the steeper slope, predicted for Npart < 60 (not covered by the employed data set), 
interpolates to the corresponding pp and pp data, at Npart = 2. It resembles the 
pattern observed in the NA49 data (Fig. 7.13) for small Npart collisions of light 
A+A systems, at ./s = 17-20 GeV, and may be seen, to reflect the onset of QCD 
saturation. Finally we note that the conclusions of the above, partially heuristic 
approach [63], represented by Eqs. (7.13)-(7.16), have been backed up by the CGC 
theory of McLerran and Venugopulan [64, 65, 75], predictions of which we have 
illustrated in Fig. 7.7. 

Bulk hadron production in AA collisions at high ./s can be related, via the 
assumption of universality of high energy QCD scattering, to the phenomenon of 
geometric scaling first observed in HERA deep inelastic ep cross sections. The 
underlying feature is a QCD saturation effect arising from the diverging areal 
parton density, as confronted with the limited areal resolution Q?, inherent in 
the considered scattering process. The "saturation scale" 0?0c A) captures the 
condition that a single partonic charge source within the transverse partonic density 
profile can just be resolved by a sufficiently high Q?. Bulk hadron production in 
A+A collisions falls below this scale. 


7.2.5 Transverse Phase Space: Equilibrium and the QGP State 


At RHIC energy, ./s = 200 GeV, the Au+Au collision reactants are longitudinally 
contracted discs. At a nuclear radius R ~ A!/? fm and Lorentz y ~ 100 their 
primordial interpenetration phase ends at time tọ < 0.15 fm/c. This time scale is 
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absent in e* e^ annihilation at similar ./s where tọ ^ 0.1 fm/c marks the end 
of the primordial pQCD partonic shower evolution [83] during which the initially 
created qq pair, of “virtually” Q = /s/2 each, multiplies in the course of the 
QCD DGLAP evolution in perturbative vacuum, giving rise to daughter partons of 
far lower virtuality, of a few GeV. In A+A collisions this shower era should last 
longer, due to the interpenetrational spread of primordial collision time. It should 
be over by about 0.25 fm/c. The shower partons in e*e^ annihilation are localized 
within back to back cone geometry reflecting the directions of the primordial quark 
pair. The eventually observed "jet" signal, created by an initial Q? of 10* GeV’, is 
established by then. Upon a slow-down of the dynamical evolution time scale to t ~ 
1 fm/c the shower partons fragment further, acquiring transverse momentum and yet 
lower virtuality, then to enter a non perturbative QCD phase of color neutralization 
during which hadron-like singlet parton clusters are formed. Their net initial pQCD 
virtuality, in pQCD vacuum, is recast in terms of non-perturbative vacuum hadron 
mass. The evolution ends with on-shell, observed jet-hadrons after about 3 fm/c of 
overall reaction time. 

Remarkably, even in this, somehow most elementary process of QCD evolution, 
an aspect of equilibrium formation is observed, not in the narrowly focused final 
dijet momentum topology but in the relative production rates of the various created 
hadronic species. This so-called “hadrochemical” equilibrium among the hadronic 
species is documented in Fig. 7.17. The hadron multiplicities per e* e^ annihilation 
event at /s = 91.2 GeV [38] are confronted with a Hagedorn [38] canonical 
statistical Gibbs ensemble prediction [84] which reveals that the apparent species 
equilibrium was fixed at a temperature of T = 165 MeV, which turns out to be the 
universal hadronization temperature of all elementary and nuclear collisions at high 
A/s (Hagedorns limiting temperature of the hadronic phase of matter). We shall 
return to this topic in Sect. 7.3 but note, for now, that reactions with as few as 20 
charged particles exhibit such statistical equilibrium properties, a pre-requisite for 
application of thermodynamic or hydrodynamic concepts. 

What happens with parton (and hadron) dynamics in A+A collisions after to? 
There will not be a QCD evolution in vacuum (which would be over after 3 fm/c) 
as the transverse radius of the interacting system is large. It may grow to about 
twice the nuclear radius, i.e. to about 15 fm before interactions cease; i.e. the system 
needs about 15 fm/c to decouple. This simple fact is the key to our expectation that 
the expansive evolution of the initial high energy density deposited in a cylinder 
of considerable diameter (about 10 fm), may create certain equilibrium properties 
that allow us to treat the contained particles and energy in terms of thermodynamic 
phases of matter, such as a partonic QGP liquid, or a hadronic liquid or gas, etc. 
Such that the expansion dynamics makes contact to the phase diagram illustrated in 
Fig. 7.1. This expectation turns out to be justified as we shall describe in Sects. 7.3 
and 7.4. What results for the evolution after to in a central A+A collision is sketched 
in Fig. 7.18 by means of a schematic 2-dimensional light cone diagram, which is 
entered by the two reactant nuclei along z = tf trajectories where z is the beam 
direction and Lorentz contraction has been taken to an extreme, such that there 
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Fig. 7.17 Hadron multiplicities in LEP et e^ annihilation at /s = 91.2 GeV confronted with the 
predictions of the canonical statistical hadronization model [84] 
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Fig. 7.18 Schematic light cone diagram of the evolution of a high energy heavy ion collision, 
indicating a formation phase Tg (see text) 
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occurs an idealized t = z = 0 interaction "point". Toward positive t the light cone 
proper time profiles of progressing parton-hadron matter evolution are illustrated. 
The first profile illustrated here corresponds to the end of formation time to. From 
our above discussion of the e*e^ annihilation process one obtains a first estimate, 
To > 0.25 fm/c (including interpenetration time of 0.15 fm/c at RHIC) which 
refers to processes of very high Q? > 10? GeV?, far above the saturation scale 
Q? discussed in the previous section. The latter scale has to be taken into account 
for low pr hadron production. 

It is the specific resolution scale Q? of a QCD sub-process, as enveloped in 
the overall collision dynamics of two slabs of given transverse partonic structure 
function density, that determines which fraction of the constituent partons enters 
interaction. In the simple case of extremely high Q? processes the answer is that 
all constituents are resolved. However, at modest Q? (dominating bulk hadron 
production) the characteristic QCD saturation scale Q2(x) gains prominence, 
defined such that processes with Q? < Q? do not exploit the initial transverse 
parton densities at the level of independent single constituent color field sources 
(see Eq. (7.11)). For such processes the proper formation time scale, to, is of order 
of the inverse saturation momentum [61], 1/ Q; ~ 0.2 fm/c at ./s = 200 GeV. The 
first profile of the time evolution, sketched in Fig. 7.18, should correspond to proper 
time t = to = 0.25 fm/c at RHIC energy. At top SPS energy, ./s = 17.3 GeV, we 
cannot refer to such detailed QCD considerations. A pragmatic approach suggests 
to take the interpenetration time, at y ^ 8.5, for guidance concerning the formation 
time, which thus results as t9 © 1.5 fm/c. 

In summary of the above considerations we assume that the initial partonic 
color sources, as contained in the structure functions (Fig. 7.14), are spread out in 
longitudinal phase space after light cone proper time t = tọ 7 0.2 fm/c, at top 
RHIC energy, and after to ~ 1.5 fm/c at top SPS energy. No significant transverse 
expansion has occurred at this early stage, in a central collision of A ~ 200 
nuclei with transverse diameter of about 12 fm. The Bjorken estimate [45] of initial 
energy density e (Eq. (7.1)) refers to exactly this condition, after formation time 
to. In order to account for the finite longitudinal source size and interpenetration 
time, at RHIC, we finally put the average tọ ^ 0.3 fm, at ./s = 200 GeV, 
indicating the “initialization time" after which all partons that have been resolved 
from the structure functions are engaged in shower multiplication. As is apparent 
from Fig. 7.18, this time scale is Lorentz dilated for partons with a large longitudinal 
momentum, or rapidity. This means that the slow particles are produced first toward 
the center of the collision region, and the fast (large rapidity) particles are produced 
later, away from the collision region. This Bjorken “inside-out” correlation [45] 
between coordinate- and momentum-space is similar to the Hubble expansion 
pattern in cosmology: more distant galaxies have higher outward velocities. This 
means that the matter created in A+A collisions at high ./s is also born expanding, 
however with the difference that the Hubble flow is initially one dimensional 
along the collision axis. This pattern will continue, at ./s = 200 GeV, until the 
system begins to feel the effects of finite size in the transverse direction which 
will occur at some time fg in the vicinity of 1 fm/c. However, the tight correlation 
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between position and momentum initially imprinted on the system will survive all 
further expansive evolution of the initial "firetube", and is well recovered in the 
expansion pattern of the finally released hadrons of modest pr as we shall show 
when discussing radial flow and pion pair Bose-Einstein momentum correlation (see 
Sects. 7.2.6 and 7.7). 

In order to proceed to a more quantitative description of the primordial dynamics 
(that occurs onward from ro for as long the time period of predominantly longitu- 
dinal expansion might extend) we return to the Bjorken estimate of energy density, 
corresponding to this picture [45], as implied by Eq. (7.1), which we now recast as 


dN E 
es (5) (ef) @ RÈ to)! (7.17) 


where the first term is the (average) total hadron multiplicity per unit rapidity 
which, multiplied with the average hadron transverse energy, equals the total 
transverse energy recorded in the calorimetric study shown in Fig. 7.3, as employed 
in Eq. (7.1). The quantity RA is, strictly speaking, not the radius parameter of the 
spherical Woods-Saxon nuclear density profile but the rms of the reactant overlap 
profiles as projected onto the transverse plane (and thus slightly smaller than RA ^ 
A!/3 fm). Employing A!/? here (as is done throughout) leads to a conservative 
estimate of e, a minor concern. However, the basic assumption in Eq. (7.17) is to 
identify the primordial transverse energy "radiation", of an interactional cylindric 
source of radius Ra and length to (where t9 < to < 1 fm/c, not Lorentz dilated at 
midrapidity), with the finally emerging bulk hadronic transverse energy. We justify 
this assumption by the two observations, made above, that 


1. the bulk hadron multiplicity density per unit rapidity (dN})/ (dy) resembles the 
parton density, primordially released at saturation scale to (Figs. 7.7 and 7.16) at 
A/s = 200 GeV, and that 

2. the global emission pattern of bulk hadrons (in rapidity and pr) closely reflects 
the initial correlation between coordinate and momentum space, characteristic of 
a primordial period of a predominantly longitudinal expansion, as implied in the 
Bjorken model. 


Both these observations are surprising, at first sight. The Bjorken model was 
conceived for elementary hadron collisions where the expansion proceeds into 
vacuum, i.e. directly toward observation. Figure 7.18 proposes that, to the contrary, 
primordially produced partons have to transform through further, successive stages 
of partonic and hadronic matter, at decreasing but still substantial energy density, 
in central A+A collisions. The very fact of high energy density, with implied short 
mean free path of the constituent particles, invites a hydrodynamic description of the 
expansive evolution. With initial conditions fixed between to and fo, an ensuing 3- 
dimensional hydrodynamic expansion would preserve the primordial Bjorken-type 
correlation between position and momentum space, up to lower density conditions 
and, thus, close to emission of the eventually observed hadrons. We thus feel 
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justified to employ Eq. (7.1) or (7.17) for the initial conditions at RHIC, obtaining 
[61, 84] 


6 GeV/fm? < e < 20 GeV/fn? (7.18) 


for the interval 0.3 fm/c < to < 1 fm/c, in central Au+Au collisions at y ~ 0 
and ./s = 200 GeV. The energy density at top SPS energy, ./s = 17.3 GeV, can 
similarly be estimated [43, 44] to amount to about 3 GeV/fm? at a tọ of 1 fm/c but 
we cannot identify conditions at Tọ < fo in this case as the mere interpenetration of 
two Pb nuclei takes 1.4 fm/c. Thus the commonly accepted tọ = 1 fm/c may lead 
to a high estimate. An application of the parton-hadron transport model of Ellis and 
Geiger [85, 86] to this collision finds e — 3.3 GeV/fm? at t — 1 fm/c. A primordial 
energy density of about 3 GeV/fm? is 20 times po ~ 0.15 GeV/fm/?, the average 
energy density of ground state nuclear matter, and it also exceeds, by far, the critical 
QCD energy density, of 0.6 < €, < 1 GeV/fm? according to lattice QCD [48]. The 
initial dynamics thus clearly proceeds in a deconfined QCD system also at top SPS 
energy, and similarly so with strikingly higher energy density, at RHIC, where time 
scales below 1 fm/c can be resolved. 

However, in order now to clarify the key question as to whether, and when con- 
ditions of partonic dynamical equilibrium may arise under such initial conditions, 
we need estimates both of the proper relaxation time scale (which will, obviously, 
depend on energy density and related collision frequency), and of the expansion 
time scale as governed by the overall evolution of the collision volume. Only if 
t (relax.) « t (expans.) one may conclude that the “deconfined partonic system" 
can be identified with a “deconfined QGP state of QCD matter" as described e.g. by 
lattice QCD, and implied in the phase diagram of QCD matter suggested in Fig. 7.1. 

For guidance concerning the overall time-order of the system evolution we 
consider information [87] obtained from Bose-Einstein correlation analysis of pion 
pair emission in momentum space (for detail see Sect. 7.7). Note that pions should 
be emitted at any stage of the evolution, after formation time, from the surface 
regions of the evolving "fire-tube". Bulk emission of pions occurs, of course, after 
hadronization (the latest stages illustrated in the evolution sketch given in Fig. 7.18). 
The dynamical pion source expansion models by Heinz [88] and Sinyukov [89] 
elaborate a Gaussian emission time profile, with mean ry (the decoupling time) and 
width Ar (the duration of emission). 

Figure 7.19 shows an application of this analysis to central Pb+Pb collision 
negative pion pair correlation data obtained by NA49 at top SPS energy, ys = 
17.3 GeV [90], where tf © 8 fm/c and At ~ 4 fm/c (note that t = 0 in Fig. 7.19 
corresponds, not to interaction time t = 0 but tot ~ 1.4 fm/c, the end of the 
interpenetration phase). We see, first of all, that the overall dynamical evolution 
of a central Pb+Pb collision at ys = 17.3 GeV is ending at about 15 fm/c; 
the proper time defines the position of the last, decoupling profile illustrated in 
Fig. 7.18, for the SPS collisions considered here. While the details of Fig. 7.19 
will turn out to be relevant to our later discussion of hadronization (Sect. 7.3) and 
hadronic expansion (Sect. 7.4), we are concerned here with the average proper time 
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Fig. 7.19 Time profile of 
pion decoupling rate from the 
fireball in a central Pb+Pb 
collision, with t = 0 the end 
of the formation phase. 
Bose-Einstein correlation of 
7 7 pairs yields an average 
Gaussian decoupling profile 
with tf = 8 fm/c and duration 
of emission parameter 

At = 4 fm/c [87, 88] 


Emission 


at which the partonic phase ends. After consideration of the duration widths of these 
latter expansion phases [86, 87] one arrives at an estimate for the average time, 
spent before hadronization, of At = 3—4 fm/c, again in agreement with the parton 
cascade model mentioned above [86]. This model also leads to the conclusion that 
parton thermal equilibrium is, at least, closely approached locally in these central 
Pb+Pb collisions as far as mid-rapidity hadron production is concerned (at forward- 
backward rapidity the cascade re-scattering processes do not suffice, however). 

This finding agrees with earlier predictions of Trejax = 1-2 fm/c at top SPS 
energy [91]. However we note that all such calculations employ perturbative 
QCD methods, implying the paradoxical consequence that equilibrium is closely 
approached only at the end of the partonic phase, at such low 4/s, i.e. in a QGP 
state at about T = 200 MeV which is, by definition, of non-perturbative nature. We 
shall return to the question of partonic equilibrium attainment at SPS energy in the 
discussion of the hadronization process in nuclear collisions (Sect. 7.3). 

Equilibrium conditions should set in earlier at top RHIC energy. As transverse 
partonic expansion should set in after the proper time interval 0.3 fm/c < fg < 
1 fm/c (which is now resolved by the early dynamics, unlike at top SPS energy), 
we take guidance from the Bjorken estimate of primordial energy density which is 
based on transverse energy production data. Conservatively interpreting the result 
in Eq. (7.18) we conclude that € is about four times higher than at ./s = 17.3 GeV 
in the above proper time interval. As the binary partonic collision frequency scales 
with the square of with the square of the density p (related to the energy density € 
via the relation e = (E) p = Tp), and is inversely proportional to the relaxation time 
Trelax We expect 


Trelax X (1/p)* ~ (T/e)” (7.19) 
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which implies that Trelax (RHIC) © 0.25 trelax (SPS) ~ 0.5 fm/c if we employ the 
estimate T (RHIC) = 2T (SPS). This crude estimate is, however, confirmed by the 
parton transport model of Molar and Gyulassy [92]. 

Partonic equilibration at ./s = 200 GeV should thus set in at a time scale 
commensurate to the (slightly smaller) formation time scale, at which the to be 
participant partons are resolved from the initial nucleon structure functions and 
enter shower multiplication. Extrapolating to the conditions expected at LHC energy 
(Vs = 5.5 TeV for A+A collisions), where the initial parton density of the structure 
functions in Fig. 7.14 is even higher (x ~ 107? at mid-rapidity), and so is the initial 
energy density, we may expect conditions at which the resolved partons are almost 
“born into equilibrium". 

Early dynamical local equilibrium at RHIC is required to understand the observa- 
tions concerning elliptic flow, with which we shall deal, in detail, in Sect. 7.4. This 
term refers to a collective anisotropic azimuthal emission pattern of bulk hadrons 
in semi-peripheral collisions, a hydrodynamical phenomenon that originates from 
the initial geometrical non-isotropy of the primordial interaction zone [93, 94]. A 
detailed hydrodynamic model analysis of the corresponding elliptic flow signal 
at RHIC [95] leads to the conclusion that local equilibrium (a prerequisite to the 
hydrodynamic description) sets in at tg © 0.6 fm/c. This conclusion agrees with the 
estimate via Eq. (7.19) above, based on Bjorken energy density and corresponding 
parton collisions frequency. 

We note that the concept of a hydrodynamic evolution appears to be, almost 
necessarily ingrained in the physics of a system born into (Hubble-type) expansion, 
with a primordial correlation between coordinate and momentum space, and at 
extreme initial parton density at which the partonic mean free path length A is close 
to the overall spatial resolution resulting from the saturation scale, i.e. A ~ 1/Qs. 

The above considerations suggest that a quark-gluon plasma state should be 
created early in the expansion dynamics at ./s = 200 GeV, at about T = 300 MeV, 
that expands hydrodynamically until hadronization is reached, at T œ~ 165- 
170 MeV. Its manifestations will be considered in Sects. 7.3-7.6. At the lower SPS 
energy, up to 17.3 GeV, we can conclude, with some caution, that a deconfined 
hadronic matter system should exist at T ~ 200 MeV, in the closer vicinity of 
the hadronization transition. It may closely resemble the QGP state of lattice QCD, 
near Tp. 


7.2.6 Bulk Hadron Transverse Spectra and Radial Expansion 
Flow 


In this chapter we analyze bulk hadron transverse momentum spectra obtained at 
SPS and RHIC energy, confronting the data with predictions of the hydrodynamical 
model of collective expansion matter flow that we have suggested in the previous 
section, to arise, almost necessarily, from the primordial Hubble-type coupling 


7 Relativistic Nucleus-Nucleus Collisions and the QCD Matter Phase Diagram 347 


between coordinate and momentum space that prevails at the onset of the dynamical 
evolution in A+A collisions at high ys. As all hadronic transverse momentum 
spectra initially follow an approximately exponential fall-off (see below) the bulk 
hadronic output is represented by thermal transverse spectra at pr 2 GeV/c. We 
shall turn to high pr information in later sections. 

Furthermore we shall focus here on mid-rapidity production in near central 
A+A collisions, because hydrodynamic models refer to an initialization period 
characterized by Bjorken-type longitudinal boost invariance, which we have seen 
in Figs.7.7 and 7.9 to be restricted to a relatively narrow interval centered at 
mid-rapidity. Central collisions are selected to exploit the azimuthal symmetry 
of emission, in an ideal impact parameter b — 0 geometry. We thus select the 
predominant, relevant hydrodynamic “radial flow” expansion mode, from among 
other, azimuthally oriented (directed) flow patterns that arise once this cylindrical 
symmetry (with respect to the beam direction) is broken in finite impact parameter 
geometries. 

In order to define, quantitatively, the flow phenomena mentioned above, we 
rewrite the invariant cross section for production of hadron species i in terms of 
transverse momentum, rapidity, impact parameter b and azimuthal emission angle 
Yp (relative to the reaction plane), 


NE E 2 L + 2v! (pr, b) cos qp + 2v) (pr, b) cosQgp) +.. 1 
Pr dpr dy dep 2x pr dpr dy 


(7.20) 


where we have expanded the dependence on oy into a Fourier series. Due to reflec- 
tion symmetry with respect to the reaction plane in collisions of identical nuclei, 
only cosine terms appear. Restricting to mid-rapidity production all odd harmonics 
vanish, in particular the “directed flow" coefficient vi , and we have dropped the y- 
dependence in the flow coefficients vi and v). The latter quantifies the amount of 
"elliptic flow", to which we turn in Sect. 7.4. In the following, we will restrict to 
central collisions which we shall idealize as near-zero impact parameter processes 
governed by cylinder symmetry, whence all azimuthal dependence (expressed by the 
vi, v, ... terms) vanishes, and the invariant cross section reduces to the first term 
in Eq. (7.20), which by definition also corresponds to all measurements in which the 
orientation of the reaction plane is not observed. 

Typical transverse momentum spectra of the latter type are shown in Fig. 7.20, 
for charged hadron production in Au+Au collisions at ./s = 200 GeV, exhibiting 
mid-rapidity data at various collision centralities [97]. We observe a clear-cut 
transition, from bulk hadron emission at pr < 2 GeV/c featuring a near-exponential 
cross section (i.e. a thermal spectrum), to a high pr power-law spectral pattern. 
Within the context of our previous discussion (Sect. 7.2.4) we tentatively identify 
the low pr region with the QCD physics near saturation scale. Hadron production 
at pr — 10 GeV/c should, on the other hand, be the consequence of primordial 
leading parton fragmentation originating from “hard”, high Q? perturbative QCD 
processes. 
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Fig. 7.20 Transverse 
momentum spectra of 
charged hadrons in Au+Au 
collisions at ./s = 200 GeV, 
in dependence of collision 
centrality [97] (offset as 
indicated), featuring 
transition from exponential to 
power law shape 
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We thus identify bulk hadron production at low pr as the emergence of 
the initial parton saturation conditions that give rise to high energy density and 
small equilibration time scale, leading to a hydrodynamical bulk matter expansion 
evolution. Conversely, the initially produced hard partons, from high Q? processes, 
are not thermalized into the bulk but traverse it, as tracers, while being attenuated 
by medium-induced rescattering and gluon radiation, the combined effects being 
reflected in the high pr inclusive hadron yield, and in jet correlations of hadron 
emission. We shall turn to the latter physics observables in Sect. 7.5, while staying 
here with low pr physics, related to hydrodynamical expansion modes, focusing on 
radially symmetric expansion. 

In order to infer from the spectral shapes of the hadronic species about the 
expansion mechanism, we first transform to the transverse mass variable, mp = 
(ph + m2)!72, via 


1 dNi 1 dNj 


— — = — (7.21) 
2x prdprdy  2z my dmr dy 


because it has been shown in p+p collisions [98] near RHIC energy that the mr 
distributions of various hadronic species exhibit a universal pattern (“mr scaling") 
at low mr: 


1 dN; 


— — ~ = Ai —miJT 7.22 
D uns i exp(—m7/T) (7.22) 
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with a universal inverse slope parameter T and a species dependent normalization 
factor A. Hagedorn showed [99] that this scaling is characteristic of an adiabatic 
expansion of a fireball at temperature T. We recall that, on the other hand, an ideal 
hydrodynamical expansion is isentropic. 

Figure 7.21 shows the ,/s dependence of the average transverse kinetic energy 
(mi) — mí for pions, kaons and protons observed at mid-rapidity in central 
Au+Au/Pb+Pb collisions [54]. Similarly, the inverse slope parameter T resulting 
from a fit of Eq. (7.22) to K* and K` transverse mass spectra (at pr < 2 GeV/c) 
is shown in Fig. 7.22, both for nuclear and p+p collisions [100]. We see, first of all, 
that mr scaling does not apply in A+A collisions, and that the kaon inverse slope 
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Fig. 7.21 The average transverse kinetic energy (my) — mo for pions, kaons and protons vs. ./5 
in central Au+Au/Pb+Pb collisions [54]. Open symbols represent negative hadrons 
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Fig. 7.22 The inverse slope parameter T of Eq. (7.22) for K* and K~ transverse mass spectra at 
pr « 2 GeV/c and mid-rapidity in central A+A, and in minimum bias p+p collisions [100] 
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parameter, T ~ 230 MeV over the SPS energy regime, cannot be identified with the 
fireball temperature at hadron formation which is Th ~ 165 MeV from Fig. 7.1. The 
latter is seen, however, to be well represented by the p+p spectral data exhibited in 
the left panel of Fig. 7.22. There is, thus, not only thermal energy present in A+A 
transverse expansion, but also hydrodynamical radial flow. 

We note that the indications in Figs. 7.21 and 7.22, of a plateau in both (mr) 
and T, extending over the domain of SPS energies, 6 < Js < 17 GeV, have 
not yet been explained by any fundamental expansive evolution model, including 
hydrodynamics. Within the framework of the latter model, this is a consequence 
of the initialization problem [96] which requires a detailed modeling, both of 
primordial energy density vs. equilibration time scale, and of the appropriate 
partonic matter equation of state (EOS) which relates expansion pressure to energy 
density. At top RHIC energy, this initialization of hydro-flow occurs, both, at a 
time scale tg œ% 0.6 fm/c which is far smaller than the time scale of eventual bulk 
hadronization (t ~ 3 fm/c), and at a primordial energy density far in excess of the 
critical QCD confinement density. After initialization, the partonic plasma phase 
thus dominates the overall expansive evolution, over a time interval far exceeding 
the formation and relaxation time scale. 

Thus, at RHIC energy, parton transport [92] and relativistic hydrodynamic 
[95, 96] models establish a well developed expansion mode that survives the 
subsequent stages of hadronization and hadronic expansion. This is reflected in their 
success in describing elliptic flow. On the other hand, the hydrodynamical model 
far overestimates elliptic flow at SPS energy [96] at which, as we have shown in 
Sect. 7.2.5, the initialization period may be not well separated from the confinement 
(hadronization) stage. Thus, whereas the expansion evolution at ./s = 200 GeV 
(occurring at near-zero baryo-chemical potential in Fig.7.1) "races" across the 
parton-hadron phase boundary with fully established flow patterns, near ug — 0 
where lattice QCD predicts the phase transformation to be merely a soft cross-over 
[16], the dynamics at ./s = 10-20 GeV may originate from only slightly above, 
or even at the phase boundary, thus sampling the domain 200 < ug < 500 MeV 
where the equation of state might exhibit a “softest point" [96]. The hydrodynamic 
model thus faces formidable uncertainties regarding initialization at SPS energy. 

The plateau in Figs. 7.21 and 7.22 may be the consequence of the fact that not 
much flow is generated in, or transmitted from the partonic phase, at SPS energies, 
because it is initialized close to the phase boundary [100] where the expected critical 
point [9, 10] (Fig. 7.1), and the corresponding adjacent first order phase transition 
might focus [101] or stall [96] the expansion trajectory, such that the observed radial 
flow stems almost exclusively from the hadronic expansion phase. The SPS plateau, 
which we shall subsequently encounter in other bulk hadron variables (elliptic flow, 
HBT radii) might thus emerge as a consequence of the critical point or, in general, 
of the flatness of the parton-hadron coexistence line. RHIC dynamics, on the other 
hand, originates from far above this line. 

Hadronic expansion is known to proceed isentropically [102]: commensurate 
to expansive volume increase the momentum space volume must decrease, from 
a random isotropic thermal distribution to a restricted momentum orientation 
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preferentially perpendicular to the fireball surface, i.e. radial. The initial thermal 
energy, implied by the hadron formation temperature Ty = 165 MeV, will thus fall 
down to a residual Tp at hadronic decoupling from the flow field (“thermal freeze- 
out") plus a radial transverse kinetic energy term m; (Br)? where m; is the mass of 
the considered hadron species and (£r) the average radial velocity. We thus expect 
[103] for the slope of equation (7.22): 


T = Tg + mi (Br)? , pr <2 GeV/c (7.23) 
and 
1 4 (v 1/2 
Pate (==) (pr mi (1.24) 
1 — (vr) 


the latter expression valid at pr larger than hadron mass scale (T then is the 
“blue-shifted temperature" at decoupling [104] and (vr) the average transverse 
velocity). The assumption that radial flow mostly originates from the hadronic 
expansion phase is underlined by the proportionality of flow energy to hadron mass 
(Eq. (7.23)). 

Figure 7.23 illustrates this proportionality, by a recent compilation [103] of RHIC 
results for central Au+Au collisions at ./s = 200 GeV, and SPS results for central 
Pb+Pb collisions at top SPS energy, ./s = 17.3 GeV. At the latter energy the slope 
parameter of the ® meson is seen to be close to that of the similar mass baryons 
p and A, emphasizing the occurrence of m; scaling as opposed to valence quark 
number scaling that we will encounter in RHIC elliptic flow data [94]. As is obvious 
from Fig. 7.23 the multi-strange hyperons and charmonia exhibit a slope saturation 
which is usually explained [103] as a consequence of their small total cross sections 
of rescattering from other hadrons, leading to an early decoupling from the bulk 
hadron radial flow field, such that (8r) o < (Br) p 
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According to our observations with Eq. (7.23) a hydrodynamical ansatz for 
the transverse mass spectrum of hadrons should thus contain the variables "true 
temperature" Tp at decoupling from the flow field, and its average velocity (fT), 
common to all hadrons. This is the case for the blast wave model [104] developed 
as an approximation to the full hydrodynamic formalism [96], assuming a common 
decoupling or "freeze-out" from flow, for all hadronic species, and a boost-invariant 
longitudinal expansion: 


dN; sh inh 
i mrki qe cA) y 2e (7.25) 
mr dmr dy TE Tp 


where p = tanh~! By. In an extended version of this model a function is included 
that describes the radial profile of the transverse velocity field, Br(r) = Br r/R, 
instead of employing a fixed Br at decoupling [106]. Figure 7.24 flow shows [54] 
the resulting energy dependence of Tp and (Sr), for the same set of data as implied 
already in Figs. 7.21 and 7.22. The “true” decoupling temperature rises steeply at the 
AGS and less so at SPS energy (as does ()), to a value of about 95 MeV at top SPS 
energy, which is considerably lower than the chemical freeze-out temperature, Ty = 
165 MeV, at which the hadronic species relative yield composition of the hadronic 
phase becomes stationary (see Sect. 7.3, and Fig. 7.1). Chemical decoupling thus 
occurs early, near the parton-hadron phase boundary, whereas hadronic radial flow 
ceases after significant further expansion and cooling, whence the surface radial 
velocity (its average value given by (fr) in Fig. 7.24) approaches Br 7 0.65. Both 
data sets again exhibit an indication of saturation, over the interval toward top SPS 
energy: the SPS plateau. This supports our above conjecture that radial flow is, 
predominantly, a consequence of isentropic bulk hadronic expansion in this energy 
domain, which sets in at Ty. At RHIC energy, both parameters exhibit a further rise, 


140 i 0.50 pa ; 
BOF h i FR (B) J 
120 F f i or i 
110 È i omt 4 
wE 4 jw i : 
z gor E . J Sot T H J 
= 80b ; 4 038 à i 7 
nt * J d a 4 
eL oi 4 934 4 
sok * 4 ot 4 
v5 [GeV] v5 [GeV] 


Fig. 7.24 Hadron decoupling temperature Tp, and average radial flow velocity (f) extracted from 
blast wave model (see Eq. (7.25)) fits of mr spectra vs. ./s [54] 
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suggesting that primordial partonic flow begins to contribute significantly to radial 
flow. 


7.3 Hadronization and Hadronic Freeze-Out in A+A 
Collisions 


Within the course of the global expansion of the primordial reaction volume 
the local flow "cells" will hit the parton-hadron phase boundary as their energy 
density approaches egi; ^ 1 GeV/fm?. Hadronization will thus occur, not at 
an instant over the entire interaction volume, but within a finite overall time 
interval [86] that results from the spread of proper time at which individual cells, 
or coherent clusters of such cells (as developed during expansion) arrive at the 
phase boundary. However, irrespective of such a local-temporal occurrence, the 
hadronization process (which is governed by non perturbative QCD at the low 
Q? corresponding to bulk hadronization) universally results in a novel, global 
equilibrium property that concerns the relative abundance of produced hadrons and 
resonances. This so-called “hadrochemical equilibrium state” is directly observable, 
in contrast to the stages of primordial parton equilibration that are only indirectly 
assessed, via dynamical model studies. 

This equilibrium population of species occurs both in elementary and nuclear 
collisions [107]. We have seen in Fig. 7.17 a first illustration, by e*e^ annihilation 
data at ys = 91.2 GeV LEP energy, that are well reproduced by the partition 
functions of the statistical hadronization model (SHM) in its canonical form [84]. 
The derived hadronization temperature, Ty = 165 MeV, turns out to be universal to 
all elementary and nuclear collision processes at ./s > 20 GeV, and it agrees with 
the limiting temperature predicted by Hagedorn [38] to occur in any multi-hadronic 
equilibrium system once the energy density approaches about 0.6 GeV/fm?. Thus, 
the upper limit of hadronic equilibrium density corresponds, closely, to the lower 
limit, ei, = 0.6-1.0 GeV/fm? of partonic equilibrium matter, according to lattice 
QCD [48]. In elementary collisions only about 20 partons or hadrons participate: 
there should be no chance to approach thermodynamic equilibrium of species by 
rescattering cascades, neither in the partonic nor in the hadronic phase. The fact that, 
nevertheless, the hadron formation temperature Ty coincides with the Hagedorn 
limiting temperature and with the QCD confinement temperature, is a consequence 
of the non-perturbative QCD hadronization process itself [85], which “gives birth" 
to hadrons/resonances in canonical equilibrium, at high ./s, as we shall see below. 
This process also governs A+A collisions but, as it occurs here under conditions of 
high energy density extended over considerable volume, the SHM description now 
requires a grand canonical ensemble, with important consequences for production 
of strange hadrons (strangeness enhancement). 

The grand canonical order of hadron/resonance production in central A+A 
collisions, and its characteristic strangeness enhancement shows that a state of 
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extended matter that is quantum mechanically coherent must exist at hadronization 
[87, 88, 107]. Whether or not it also reflects partonic equilibrium properties 
(including flavor equilibrium), that would allow us to claim the direct observation of 
a quark gluon plasma state near Tz, cannot be decided on the basis of this observation 
alone, as the hadronization process somehow generates, by itself, the observed 
hadronic equilibrium. This conclusion, however, is still the subject of controversy 
[107]. 

Two typical examples of grand canonical SHM application are illustrated in 
Figs. 7.25 and 7.26, the first showing total hadron multiplicities in central Pb+Pb 
collisions at ./s = 17.3 GeV by NA49 [100] confronted with SHM predictions by 
Becattini et al. [19]. This plot is similar to Fig. 7.17 in which e*e^ annihilation to 
hadrons is confronted with a SHM prediction derived from the canonical ensemble 
[84]. Central Au+Au collision data at ./s = 200 GeV from several RHIC exper- 
iments are compared to grand canonical model predictions by Braun-Munzinger 
et al. [108] in Fig. 7.26. The key model parameters, Ty and the baryo-chemical 
potential ug result as 159 MeV (160 MeV), and 247 MeV (20 MeV) at /s = 
17.3 (200) GeV, respectively. The universality of the hadronization temperature is 
obvious from comparison of the present values with the results of the canonical 
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Fig. 7.25 Total hadron multiplicities in central Pb+Pb collisions at ./s = 17.3 GeV [100] versus 
prediction of the grand canonical statistical hadronization model [19] 


7 Relativistic Nucleus-Nucleus Collisions and the QCD Matter Phase Diagram 355 
1 
EPP e eee. Via = 200 GN 
: RT ro 
10 E e] — 
g F UNS 
* L o Sm 
102 PHENIX 
E A BRAHMS Os a 
[ —— T= 1605, 4, =20 MeV ES 


UKpAEQKKpAEQOQddso?9k"WNx 


Fig. 7.26 Hadron multiplicity ratios at mid-rapidity in central Au+Au collisions at /s = 200 GeV 
from RHIC experiments STAR, PHENIX and BRAHMS, compared to predictions of the grand 
canonical statistical model [108] 


procedure employed in et e~ annihilation to hadrons at ./s = 91.2 GeV (Fig. 7.17), 
and in canonical SHM fits [109] to p+p collision data at ./s = 27.4 GeV where 
Ty = 159 and 169 MeV, respectively. 

Figures 7.25 and 7.26 illustrate two different approaches employed in grand 
canonical SHM application, the former addressing the values of the hadronic multi- 
plicities as obtained in approximate full 4x acceptance (within limitations implied 
by detector performance), the latter employing a set of multiplicity ratios obtained 
in the vicinity of mid-rapidity as invited, at RHIC energy, by the limited acceptance 
of the STAR and PHENIX experiments. The latter approach is appropriate, clearly, 
in the limit of boost-invariant rapidity distributions where hadron production ratios 
would not depend on the choice of the observational rapidity interval. We have 
shown in Sect. 7.2.2 that such conditions do, in fact, set in at top RHIC energy, as 
referred to in Fig. 7.26. However, at low 4/s the y-distributions are far from boost- 
invariant, and the total rapidity gap Ay may become comparable, in the extreme 
case, to the natural rapidity widths of hadrons emitted in the idealized situation 
of a single, isotropically decaying fireball positioned at mid-rapidity. Its rapidity 
spectra, Eq. (7.5), resemble Gaussians with widths T; ~ 2.35 (T/ m;)!? for hadron 
masses m;. Clearly, the particle ratios (dN; /dy)/(dN;/dy) then depend strongly 
on the position of the rapidity interval dy: away from y = 0 heavy hadrons will 
be strongly suppressed, and particle yields in narrow rapidity intervals are useless 
for a statistical model analysis unless it is known a priori that the radiator is a 
single stationary spherical fireball [110]. This is not the case toward top SPS energy 
(see Fig. 7.10), due to significant primordial longitudinal expansion of the hadron 
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emitting source. Given such conditions, the total multiplicity per collision event 
(the invariant yield divided by the total overall inelastic cross section) should be 
employed in the SHM analysis, as is exemplified in Fig. 7.25. 


7.3.1 Hadronic Freeze-Out from Expansion Flow 


The hadronic multiplicities result from integration of the invariant triple differential 
cross section over py and y. Instrumental, experiment-specific conditions tend to 
result in incomplete pr and/or y acceptances. It is important to ascertain that the 
effects of hydrodynamic transverse and longitudinal flow do not blast a significant 
part of the total hadron yield to outside the acceptance, and that they, more generally, 
do not change the relative hadron yield composition, thus basically affecting the 
SHM analysis. To see that hadronization incorporates only the internal energy in the 
co-moving frame [110], we first assume that hadrochemical freeze-out occurs on a 
sharp hypersurface X, and write the total yield of particle species i as 


d? . 
N; = J X i p" Ëo) fig p) = Í Poy, jE) (7.26) 
X x 
where do is the outward normal vector on the surface, and 
Jye)eg i dtp 20 (p)8(p? — m7) p(exp[p-u(x) —uil/T x1)! (1.27) 


is the grand canonical number current density of species i, u; the chemical potential, 
u(x) the local flow velocity, and g; the degeneracy factor. In thermal equilibrium it 
is given by 
i} (x) = pi Gu" (x) with 
u st _ 4 0 2 2 eT. 
pix) = uu QJ; (x) = E p 20(p )à(p^ — mọ) p-u(x) fi(p-uQo); T; mi) 
= EZ fi(Ep T, wi) = pi(T, ui). (7.28) 


Here Ey is the energy in the local rest frame at point x. The total particle yield of 
species i is therefore 


Ni — pi(T, pi) [Pots rut = pi(T, pi) Vx (u^) (7.29) 


where only the total comoving volume Vy of the freeze-out hypersurface X depends 
on the flow profile u”. V is thus a common total volume factor at hadronization (to 
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be determined separately), and the flow pattern drops out from the yield distribution 
over species in 4;zr acceptance [110]. For nuclear collisions at SPS energies and 
below one thus should perform a SHM analysis of the total, 47r -integrated hadronic 
multiplicities, as was done in Fig. 7.25. 

We note that the derivation above illustrates the termination problem of the 
hydrodynamic description of A+A collisions, the validity of which depends on 
conditions of a short mean free path, A < 1 fm. A precise argumentation suggests 
that two different free paths are relevant here, concerning hadron occupation number 
and hadron spectral freeze-out, respectively. As hadrochemical freeze-out occurs in 
the immediate vicinity of Te (and Ty ^: 160-165 MeV from Figs. 7.25 and 7.26), the 
hadron species distribution stays constant throughout the ensuing hadronic phase, 
i.e. the “chemical” mean free path abruptly becomes infinite at Ty, whereas elastic 
and resonant rescattering may well extend far into the hadronic phase, and so does 
collective pressure and flow. In fact we have seen in Sect. 7.2.6 that the decoupling 
from flow occurs at Tp as low as 90-100 MeV (Fig. 7.24). Thus the hydrodynamic 
evolution of high /s collisions has to be, somehow artificially, stopped at the 
parton-hadron boundary in order to get the correct hadron multiplicities N;, of 
Eqs. (7.26)-(7.29), which then stay frozen-out during the subsequent hadronic 
expansion. 

Equations (7.26)-(7.29) demonstrate the application of the Cooper-Frye pre- 
scription [111] for termination of the hydrodynamic evolution. The hyper-surface X 
describes the space-time location at which individual flow cells arrive at the freeze- 
out conditions, € = e; and T = To, of hadronization. At this point, the resulting 
hadron/resonance spectra (for species 7) are then given by the Cooper-Frye formula 


dN; u dN; 
Gp  dyprdpr 


gi 
=, f fi(p- u(x), x) p: do (x), (7.30) 
2r) Jy 

where p" f; doy is the local flux of particle i with momentum p through the surface 
X. For the phase space distribution f in this formula one takes the local equilibrium 
distribution at hadronic species freeze-out from the grand canonical SHM 


fiCE, x) = [exp{(Ei — ui) / T) E 1! (7.31) 


boosted with the local flow velocity u^(x) to the global reference frame by the 
substitution E — p-u(x). Fixing T = T, (taken e.g. from lattice QCD) the hadron 
multiplicities N; then follow from Eq. (7.29), and one compares to experiment, as 
in Figs. 7.25 and 7.26. In order now to follow the further evolution, throughout the 
hadronic rescattering phase, and to finally compare predictions of Eq. (7.30) to the 
observed flow data as represented by the various Fourier-terms of Eq. (7.20) one has 
to re-initialize (with hadronic EOS) the expansion from X(T.) = 165 MeV) until 
final decoupling [96], at T ~ 100 MeV, thus describing e.g. radial and elliptic flow. 

Alternatively, one might end the hydrodynamic description at T = T, and 
match the thus obtained phase space distribution of Eq. (7.30) to a microscopic 
hadron transport model of the hadronic expansion phase [95, 112]. This procedure 
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is illustrated in Fig. 7.27 by an UrQMD [113] calculation of Bass and Dumitru 
[114] for central Au+Au collisions at top RHIC energy. We select here the results 
concerning the survival of the hadronic multiplicities N; throughout the dynamics 
of the hadronic expansion phase, which we have postulated above, based on the 
equality of the hadronization temperatures, Ty œ% 160 MeV, observed in e*e^ 
annihilation (Fig. 7.17), where no hadronic expansion phase exists, and in central 
collisions of A ~ 200 nuclei (Figs. 7.25 and reffig:Figure26). In fact, Fig. 7.27 
shows that the (Nj) observed at the end of the hadronic cascade evolution agree, 
closely, with the initial (N;) as derived from a Cooper-Frye procedure (Eq. (7.29)) 
directly at hadronization. On the other hand, pr spectra and radial flow observables 
change, drastically, during the hadronic cascade expansion phase. 

The hadronic multiplicity distribution {N;}, arising from the hadronization 
process at high ./s, freezes-out instantaneously also in A+A collisions, and is thus 
preserved throughout the (isentropic) hadronic expansion phase. It is thus directly 
measurable and, moreover, its hadrochemical equilibrium features lend themselves 
to an analysis within the framework of Hagedorn-type statistical, grand canonical 
models. As we shall show below, the outcome of this analysis is contained in a 
[TH, uB] parameter pair that reflects the conditions of QCD matter prevailing at 
hadronization, at each considered „/s. In fact, the [T, 4] points resulting from 
the SHM analysis exhibited in Figs. 7.25 and 7.26 (at ./s = 17.3 and 200 GeV, 
respectively) have been shown in the QCD matter phase diagram of Fig. 7.1 to 
approach, closely, the parton-hadron phase coexistence line predicted by lattice 
QCD. Thus, Ty 7 T. at high ./s: hadrochemical freeze-out occurs in the immediate 
vicinity of QCD hadronization, thus providing for a location of the QCD phase 
boundary. 
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7.3.2 Grand Canonical Strangeness Enhancement 


The statistical model analysis [19, 107, 108] of the hadronization species distribution 
Ni in A+A collisions is based on the grand canonical partition function for species i, 


jV. fe kt dk 
_ Bi f (1.32) 
0 


Eih exp (E(k) — u)/T) € 1 


where E? = + m?, and ui = uBBi + ussi + uil is the total chemical 
potential for baryon number B, strangeness S and isospin 3-component 73. Its role 
in Eq. (7.32) is to enforce, on average over the entire hadron source volume, the 
conservation of these quantum numbers. In fact, making use of overall strangeness 
neutrality (* ^; N; S; = 0) as well as of conserved baryon number (participant Z+N) 
and isospin (participant (N — Z)/Z) one can reduce u; to a single effective potential 
uB. Hadronic freeze-out is thus captured in three parameters, T, V and ug. The 
density of hadron/resonance species i then results as 


n; = —— InZ; (7.33) 


which gives 


a giV F k? dk (7.34) 
i= nj = ——————— E EEEE E . 
(27)? Jo exp((Ei(k) — wi)/T} +1 


We see that the common freeze-out volume parameter is canceled if one 
considers hadron multiplicity ratios, N;/N;, as was done in Fig. 7.26. Integration 
over momentum yields the one-particle function 


xii P EB wie Y ely. Ka (=) exp (7) (1.35) 


where K» is the modified Bessel function. At high T the effects of Bose or Fermi 
statistics (represented by the +1 term in the denominators of Eqs. (7.32) and (7.34)) 
may be ignored, finally leading to the Boltzmann approximation 


VTgi » mi Hi 
Ni = xD m; Ko (=) exp (=) (7.36) 


which is the first term of Eq. (7.35). This approximation is employed throughout 
the SHM analysis. It describes the primary yield of hadron species i, directly at 
hadronization. The abundance of hadronic resonance states is obtained convoluting 
equation (7.34) with a relativistic Breit-Wigner distribution [19]. Finally, the overall 
multiplicity, to be compared to the data, is determined as the sum of the primary 
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multiplicity equation (7.36) and the contributions arising from the unresolved decay 
of heavier hadrons and resonances: 


observed 2 pp may + XBU — i) Nj. (7.37) 
j 


After having exposed the formal gear of grand canonical ensemble analysis we 
note that Eq. (7.36) permits a simple, first orientation concerning the relation of T 
to ug in A+A collisions by considering, e.g., the antiproton to proton production 
ratio. From Eq. (7.36) we infer the simple expression 


N(p)/N(p) = exp(—2up/T). (7.38) 


Taking the mid-rapidity value 0.8 for p/p (from Fig. 7.26) at top RHIC energy, 
and assuming that hadronization occurs directly at the QCD phase boundary, and 
hence T & T, ~ 165 MeV, we get ug œ~ 18 MeV from Eq. (7.38), in close 
agreement with the result, ug = 20 MeV, obtained [108] from the full SHM 
analysis. Equation (7.38) illustrates the role played by upg in the grand canonical 
ensemble. It logarithmically depends on the ratio of newly created quark-antiquark 
pairs (the latter represented by the p yield), to the total number of quarks including 
the net baryon number-carrying valence quarks (represented by the p yield). 

The most outstanding property of the hadronic multiplicities observed in central 
A+A collisions is the enhancement of all strange hadron species, by factors ranging 
from about 2 to 20, as compared to the corresponding production rates in elementary 
hadron-hadron (and e+e~ annihilation) reactions at the same A/s. Le. the nuclear 
collision modifies the relative strangeness output by a "nuclear modification factor", 
qs = Ae /0.5 Npart - NPP , which depends on Js and Npart and features a 
hierarchy with regard to the strangeness number s = 1, 2,3 of the considered 
species, RAA < RS < RAA, These properties are illustrated in Figs. 7.28 
and 7.29. The former shows the ratio of total K* to positive pion multiplicities 
in central Au+Au/Pb+Pb collisions, from lower AGS to top RHIC energies, in 
comparison to corresponding ratios from minimum bias p+p collisions [100]. We 
have chosen this ratio, instead of (K +) /Npart, because it reflects, rather directly, the 
“Wroblewski ratio” of produced strange to non-strange quarks [107], contained in 
the produced hadrons, 


s 2((s) + (5)) us 0.2 inpp 
+ 


7.39 
(m) +(d)  |0.45inAA. id 


s = 


(u) + (d) 


The low value of às in pp (and all other elementary) collisions reflects a quark pop- 
ulation far away from u, d, s flavor equilibrium, indicating strangeness suppression 
[109]. 

The so-called strangeness enhancement property of A+A collisions (obvious 
from Figs. 7.28 and 7.29) is, thus, seen as the removal of strangeness suppression; 
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Fig. 7.28 The ratio of total K * to total x* multiplicity as a function of /s, in central Au+Au and 
Pb+Pb collisions and in p+p minimum bias collisions [100] 


10r 10r 
ef 2f 
g 8 
S gf 
E EMI 
= = 
= af 
= E 
lr Ir 
L pep CHC Si+Si  Pb+Pb ! 
i 1 coil L coil 1 LLLLLLI l L rool L LL LLuul L L LLLLLI 
1 10 10? 1 10 10° 
(Nw) (Nu) 


Fig. 7.29 The nuclear modification factors RIA 23 for hyperon and anti-hyperon production in 
nucleus-nucleus collisions at /s = 17.3 GeV, relative to the p+p reference at the same energy 
scaled by Nw (— Npart). The NA49 data refer to total 4x yields [116]. Closed lines represent the 
inverse strangeness suppression factors from ref. [119], at this energy 
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it is also referred to as a strangeness saturation, in SHM analysis [107, 108], for 
the reason that A, ~ 0.45 corresponds to the grand canonical limit of strangeness 
production, implicit in the analysis illustrated in Figs. 7.25 and 7.26. The average 
RA^ at ./s > 10 GeV thus is about 2.2, both in the data of Fig. 7.28 and in the 
statistical model. It increases (Fig. 7.29) toward about 10 in s = 3 production of Q 
hyperons. 

In order to provide for a first guidance concerning the above facts and 
terminology regarding strangeness production we propose an extremely naive 
argument, based on the empirical fact of a universal hadronization temperature 
(Figs. 7.17, 7.25, and 7.26) at high J/s. Noting that (s) = (5) and (u) ~ (u) ~ 
(d) ~ (d) in a QGP system at ug near zero, and T = 165 MeV, just prior to 
hadronization, A; 7 (s) / (u) © exp((m,, — ms)/T} = 0.45 at pr — O if we take 
current quark masses, m, — m, ^: 130 MeV. Le. the value of A; in A+A collisions 
at high ./s resembles that of a grand canonical QGP at ug — 0, as was indeed 
shown in a 3 flavor lattice QCD calculation [115] at T ~ T.. On the contrary, 
a p+p collision features no QGP but a small fireball volume, at T ~ Te, within 
which local strangeness neutrality, (s) — (s) has to be strictly enforced, implying 
a canonical treatment [109]. In our naive model the exponential penalty factor thus 
contains twice the strangeness quark mass in the exponent, A, in pp collisions 
= exp{2(m, — ms)/T} ~ 0.2, in agreement with the observations concerning 
strangeness suppression, which are thus referred to as canonical suppression. In a 
further extension of our toy model, now ignoring the u, d masses in comparison to 
ms ^: 135 MeV, we can estimate the hierarchy of hyperon enhancement in A+A 
collisions, 


ROA œ NA^ /NP?.0.5 Noart © exp{(—sms+2sms)/T} = 2.2,5.1, 11.6 — (7.40) 


for s — 1, 2, 3, respectively. Figure 7.29 shows that these estimates correspond well 
with the data [116] for R^^ derived in 47 acceptance for A, & and Q as well as 
for their antiparticles, from central Pb+Pb collisions at ./s = 17.3 GeV. The p+p 
reference data, and C+C, Si+Si central collisions (obtained by fragmentation of the 
SPS Pb beam) refer to separate NA49 measurements at the same energy. 

The above, qualitative considerations suggest that the relative strangeness yields 
reflect a transition concerning the fireball volume (that is formed in the course of 
a preceding dynamical evolution) once it enters hadronization. Within the small 
volumes, featured by elementary collisions (see Sect. 7.3.3), phase space is severely 
reduced by the requirement of /ocal quantum number conservation [109, 117] 
including, in particular, local strangeness neutrality. These constraints are seen to 
be removed in A+A collisions, in which extended volumes of high primordial 
energy density are formed. Entering the hadronization stage, after an evolution of 
expansive cooling, these extended volumes will decay to hadrons under conditions 
of global quantum mechanical coherence, resulting in quantum number conser- 
vation occurring, non-locally, and on average over the entire decaying volume. 
This large coherent volume decay mode removes the restrictions, implied by local 
quantum number balancing. In the above, naive model we have thus assumed that 
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the hadronization of an Omega hyperon in A+A collisions faces the phase space 
penalty factor of only three s quarks to be gathered, the corresponding three 5 
quarks being taken care of elsewhere in the extended volume by global strangeness 
conservation. In the framework of the SHM this situation is represented by the grand 
canonical ensemble (Eqs. (7.34), (7.36)); the global chemical potential ug expresses 
quantum number conservation on average. Strict, local conservation is represented 
by the canonical ensemble. 

The grand canonical (GC) situation can be shown to be the large collision volume 
limit (with high multiplicities (N;]) of the canonical (C) formulation [118, 119], 
with a continuous transition concerning the degree of canonical strangeness suppres- 
sion [119]. To see this one starts from a system that is already in the GC limit with 
respect to baryon number and charge conservation whereas strangeness is treated 
canonically. Restricting to s — 1 and —1 the GC strange particle densities can be 
written (from Eq. (7.36)) as 


M 
A pesi ed (7.41) 


with 


Vgs 
Zs=+1 = 222 


m Ka 72) exp f (Bsus + Osuo)/T} (7.42) 


and a “fugacity factor” Aj 


can be written as [119] 


= exp (us/T). The canonical strange particle density 


nE =n. (a) (7.43) 


S 
with an effective fugacity factor 


2s - Si h(x) 
T WS18-1 b) 


where Si; = eet Zs—+1 is the sum over all created hadrons and resonances 
with s = +1, the /,(x) are modified Bessel functions, and x = 24/$1$., is 
proportional to the total fireball volume V. In the limit x ~ V — oo the 
suppression factor /1(x)/Io(x) — 1, and the ratio Si1/./S,S_1 corresponds 
exactly to the fugacity A, in the GC formulation (see Eq. (7.41)). Thus the C and GC 
formulations are equivalent in this limit, and the canonical strangeness suppression 
effect disappears. Upon generalization to the complete strange hadron spectrum, 
with s = +1, +2, +3, the strangeness suppression factor results [119] as 


(7.44) 


n(s) = Is(x)/ Io(x). (7.45) 
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In particular for small x (volume), n(s) — (x/2), and one expects that the larger 
the strangeness content of the particle the smaller the suppression factor, and hence 
the larger the enhancement in going from elementary to central A+A collisions. This 
explains the hierarchy addressed in Eq. (7.40), and apparent from the data shown in 
Fig. 7.29. In fact, the curves shown in this figure represent the results obtained from 
Eq. (7.45), for s = 1,2,3 hyperon production at ./s = 17.3 GeV [119]. They 
are seen to be in qualitative agreement with the data. However the scarcity of data, 
existing at top SPS energy for total hyperon yields, obtained in 47x acceptance (recall 
the arguments in Sect. 7.3.1) both for A+A and p+p collisions does not yet permit 
to cover the SHM strangeness saturation curves in detail, for s > 1. 

This saturation is seen in Fig. 7.29, to set in already at modest system sizes, but 
sequentially so, for ascending hyperon strangeness. Note that SHM saturation is 
sequentially approached, from Eq. (7.45), with increasing fireball volume V. In 
order to make contact to the experimental size scaling with centrality, e.g. Npart, 
the model of ref. [119], which is illustrated in Fig. 7.29, has converted the genuine 
volume scale to the Npar scale by assuming a universal eigenvolume of 7 fm? per 
participant nucleon. Le. Nparr = 10 really means a coherent fireball volume of 
70 fm?, in Fig. 7.29. Within this definition, saturation of s — 1, 2, 3 sets in at fireball 
volumes at hadronization of about 60, 240 and 600 fm?, respectively: this is the real 
message of the SHM curves in Fig. 7.29. 

The above direct translation of coherent fireball volume to participant number is 
problematic [120] as it assumes that all participating nucleons enter into a single 
primordially coherent fireball. This is, however, not the case [120] particularly in 
the relative small scattering systems that cover the initial, steep increase of n(s), 
where several local high density clusters are formed, each containing a fraction of 
Npart- This is revealed by a percolation model [120] of cluster overlap attached to a 
Glauber calculation of the collision/energy density. At each Npart an average cluster 
volume distribution results which can be transformed by Eq. (7.45) to an average 
{n(s, V)} distribution whose weighted mean is the appropriate effective canonical 
suppression factor corresponding to Npart. On the latter scale, the SHM suppression 
curve thus shifts to higher Npart, as is shown in Fig. 7.30 for the Kt /z7 ratio vs. 
Npart, measured at mid-rapidity by PHENIX in Au+Au collisions at ./s = 200 GeV, 
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which is reproduced by the percolation model [120]. Also included is a prediction 
for Cu+Cu at this energy which rises more steeply on the common Npar scale 
because the collision and energy density reached in central Cu+Cu collisions, at 
Npart © 100, exceeds that in peripheral Au+Au collisions (at the same Npart) which 
share a more prominent contribution from the dilute surface regions of the nuclear 
density profile. We note, finally, that this deviation from universal Npar scaling does 
not contradict the observations of a perfect such scaling as far as overall charged 
particle multiplicity densities are concerned (recall Fig. 7.12) which are dominated 
by pions, not subject to size dependent canonical suppression. 


7.3.3 Origin of Hadro-Chemical Equilibrium 


The statistical hadronization model (SHM) is not a model of the QCD confinement 
process leading to hadrons, which occurs once the dynamical cooling evolution 
of the system arrives at T,. At this stage the partonic reaction volume, small in 
elementary collisions but extended in A+A collisions, will decay (by whatever 
elementary QCD process) to on-shell hadrons and resonances. This coherent 
quantum mechanical decay results in a de-coherent quasi-classical, primordial on- 
shell hadron-resonance population which, at the instant of its formation, lends 
itself to a quasi-classical Gibbs ensemble description. Its detailed modalities 
(canonical for small decaying systems, grand canonical for extended fireballs in 
A+A collisions), and its derived parameters [7', ug] merely recast the conditions, 
prevailing at hadronization. The success of SHM analysis thus implies that the 
QCD hadronization process ends in statistical equilibrium concerning the hadron- 
resonance species population. 

In order to identify mechanisms in QCD hadronization that introduce the hadro- 
chemical equilibrium we refer to jet hadronization in e*e~ annihilation reactions, 
which we showed in Fig. 7.17 to be well described by the canonical SHM. In di-jet 
formation at LEP energy, ./s = 92 GeV, we find a charged particle multiplicity 
of about 10 per jet, and we estimate that, likewise, about 10 primordial partons 
participate on either side of the back-to-back di-jet [85]. There is thus no chance for 
either a partonic or hadronic, extensive rescattering toward chemical equilibrium. 
However, in the jet hadronization models developed by Amati and Veneziano [83], 
Webber [121] and Ellis and Geiger [85] the period of QCD DGLAP parton shower 
evolution (and of perturbative QCD, in general) ends with local color neutralization, 
by formation of spatial partonic singlet clusters. This QCD “color pre-confinement” 
[83] process reminds of a coalescence mechanism, in which the momenta and the 
initial virtual masses of the individual clustering partons get converted to internal, 
invariant virtual mass of color neutral, spatially extended objects. Their mass 
spectrum [121] extends from about 0.5 to 10 GeV. This cluster mass distribution, 
shown in Fig.7.31, represents the first stochastic element in this hadronization 
model. 
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Fig. 7.31 Invariant mass 
spectrum of color 
neutralization clusters in the 
Veneziano-Webber 
hadronization model 

[83, 121] 
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The clusters are then re-interpreted within non-perturbative QCD: their internal, 
initially perturbative QCD vacuum energy gets replaced by non-perturbative quark 
and gluon condensates, making the clusters appear like hadronic resonances. Their 
subsequent quantum mechanical decay to on-shell hadrons is governed by the phase 
space weights given by the hadron and resonance spectrum [85, 121]. Le. the clusters 
decay under “phase space dominance" [85], the outcome being a micro-canonical or 
a canonical hadron and resonance ensemble [84, 107]. The apparent hadro-chemical 
equilibrium thus is the consequence of QCD color neutralization to clusters, and 
their quantum mechanical decay under local quantum number conservation and 
phase space weights. We note that the alternative description of hadronization, by 
string decay [122], contains a quantum mechanical tunneling mechanism, leading 
to a similar phase space dominance [123]. 

Hadronization in e*e^ annihilation thus occurs from local clusters (or strings), 
isolated in vacuum, of different mass but similar energy density corresponding 
to QCD confinement. These clusters are boosted with respect to each other but 
it was shown [124] that for a Lorentz invariant scalar, such as multiplicity, the 
contributions of each cluster (at similar T) can be represented by a single canonical 
system with volume equal to the sum of clusters. In the fit of Fig. 7.17 this volume 
sum amounts to about 45 fm? [84]; the individual cluster volumes are thus quite 
small, of magnitude a few fm? [85]. This implies maximum canonical strangeness 
suppression but may, in fact, require a micro-canonical treatment of strangeness 
[109], implying a further suppression. These MC effects are oftentimes included 
[125] in the canonical partition functions by an extra strangeness fugacity parameter 
ys < 1 which suppresses s = 1, 2, 3 in a hierarchical manner, (N; (s)) ^ (y,). The 
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fit of Fig. 7.17 requires y, — 0.66, a value typical of canonical multiplicity analysis 
in ptp, p*p and ete” annihilation collisions [109] at ./s > 30 GeV. 

The above picture, of hadrochemical equilibrium resulting from the combined 
stochastic features of QCD color neutralization by cluster formation, and subsequent 
quantum mechanical decay to the on-shell hadron and resonance spectrum (under 
phase space governance) lends itself to a straight forward extension to A+A 
collisions. The essential new features, of grand canonical hadronization including 
strangeness enhancement, should result from the fact that extended space-time 
volumes of € > écrit are formed in the course of primordial partonic shower 
evolution, an overlap effect increasing both with ./s and with the size of the 
primordial interaction volume. As the volume of the elementary hadronization 
clusters amounts to several fm? it is inevitable that the clusters coalesce, to form 
extended "super-cluster" volumes prior to hadronization [120]. As these super- 
clusters develop toward hadronization via non perturbative QCD dynamics, it is 
plausible to assume an overall quantum mechanical coherence to arise over the 
entire extended volume, which will thus decay to hadrons under global quantum 
number conservation, the decay products thus modeled by the GC ensemble. 

Our expectation that space-time coalescence of individual hadronization clusters 
will lead to a global, quantum mechanically coherent extended super-cluster 
volume, that decays under phase space dominance, appears as an analogy to the 
dynamics and quantum mechanics governing low energy nuclear fission from 
a preceding “compound nucleus” [126]. Note that the observation of a smooth 
transition from canonical strangeness suppression to grand canonical saturation 
(Figs. 7.29, 7.30) lends further support to the above picture of a percolative growth 
[120] of the volume that is about to undergo hadronization. 

An extended, coherent quark gluon plasma state would, of course, represent 
an ideal example of such a volume [127] and, in fact, we could imagine that the 
spatial extension of the plasma state results from a percolative overlap of primordial 
zones of high energy density, which becomes more prominent with increasing ./s 
and Npart- A QGP state preceding hadronization will thus lead to all the observed 
features. However, to be precise: the hadronizing QCD system of extended matter 
decaying quantum coherently, could still be a non-equilibrium precursor of the ideal 
equilibrium QGP, because we have seen above that hadrochemical equilibrium also 
occurs in e*e^ annihilation, where no partonic equilibrium exists. It gets established 
in the course of hadronization, irrespective of the degree of equilibrium prevailing 
in the preceding partonic phase. 


7.3.4 Hadronization vs. Rapidity and ./s 


We have argued in Sect. 7.3.1 that, at relatively low 4/s, the total rapidity gap 
Ay does not significantly exceed the natural thermal rapidity spreading width 
T; & 2.35 (T/m;)! of a single, isotropically decaying fireball, centered at 
mid-rapidity and emitting hadrons of mass m; [110]. However, this procedure 
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involves an idealization because in the real Pb+Pb collision the intersecting dilute 
surface sections of the nuclear density profiles will lead to a significant contribution 
of single-scattering NN collisions, outside the central high density fireball. The 
leading hadron properties of such “corona collisions" result in wider proper rapidity 
distributions, quite different from those of the central fireball decay hadrons. 
Their contribution will thus be prominent near target/projectile rapidity, and will 
feature a canonically suppressed strangeness. The one-fireball assumption, although 
inevitable at small Ay, does not quite resemble the physical reality. This may 
explain the need for an additional strangeness suppression factor in the GC one- 
particle partition function (Eq. (7.32)) that has, unfortunately, also been labeled y; 
but expresses physics reasons quite different from the extra suppression factor that 
reflects micro-canonical phase space constraints in elementary collisions. It turns 
out that all GC analysis of central A+A collisions at low 4/s, and addressed to total 
4x multiplicities, requires a ys of 0.7—0.85 [19]; in the fit of Fig. 7.25 y, = 0.84. 
At RHIC, Ay © 11 > l';, and such difficulties disappear: y, ~ 1 at midrapidity 
and, moreover, the wide gap permits a SHM analysis which is differential in y. 
Figure 7.32 shows the y-dependence of the ratios m~/m+, K /K* and p/p as 
obtained by BRAHMS [128] in central Au+Au collisions at ./s = 200 GeV. 
The figure shows a dramatic dependence of the p/p ratio, which reflects the local 
baryochemical potential according to Eq. (7.38). At ycm > 1 the p/p ratio drops 
down steeply, to about 0.2 at y ~ 3.5, thus making close contact to the top SPS 
energy value obtained by NA49 [129]. The K^/K* ratio follows a similar but 
weaker drop-off pattern, to about 0.65 again matching with the top SPS energy value 
of about 0.6 [130]. The deviation from unity of these ratios reflects the rapidity 
densities of initial valence u, d quarks, relative to the densities of newly created 
light and strange quark-antiquark pairs, i.e. the y distribution of the net baryon 
number density, and of the related baryo-chemical potential of the GC ensemble. 
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Fig. 7.32 (Left) Anti-hadron to hadron ratios as a function of rapidity in central Au+Au collisions 
at ./s = 200 GeV. (Right) Interpretation of the correlation between p/p and K~/K* in terms of 
baryo-chemical potential ug variation in the grand canonical statistical model. From [128] 
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Thus, in analyzing successive bins of the rapidity distributions in Fig. 7.32, the major 
variation in the GC fit concerns the baryo-chemical potential ug (y) which increases 
from about 20 MeV (Fig. 7.26) at mid-rapidity, to about 150 MeV at y > 3 while the 
hadronization temperature stays constant, at T — 160 MeV. This interplay between 
K-/K*, p/p and pg is illustrated [128] in the right hand panel of Fig. 7.32, and 
shown to be well accounted for by the GC statistical model [131]. 

These considerations imply that hadronization at RHIC (and LHC) energy occurs 
local in y-space and late in time. The density distribution of net baryon number 
results from the primordial pQCD shower evolution (c.f. Sect. 7.2.4), and is thus 
fixed at formation time, fo < 0.6 fm/c at RHIC. Hadronization of the bulk partonic 
matter occurs later, at £ > 3 fm/c [86, 95], and transmits the local conditions in 
rapidity space by preserving the local net baryon quantum number density. Most 
importantly we conclude that hadronization occurs, not from a single longitudinally 
boosted fireball but from a succession of "super-clusters", of different partonic 
composition depending on y, and decaying at different time due to the Lorentz- 
boost that increases with y, in an “inside-outside” pattern (c.f. Fig. 7.18). We are 
thus witnessing at hadronization a Hubble expanding system of local fireballs. 
The detailed implications of this picture have not been analyzed yet. Note that a 
central RHIC collision thus does not correspond to a single hadronization “point” 
in the [T, jz] plane of Fig. 7.1 but samples (T, u} along the QCD parton-hadron 
coexistence line [132]. 

Throughout this chapter we have discussed hadronic freeze-out at high ./s only 
(top SPS to RHIC energy), because of the proximity of the chemical freeze-out 
parameters [7, ug] to the QCD phase boundary from lattice QCD, which suggests 
an overall picture of hadronization, to occur directly from a partonic cluster or 
super-cluster. Our discussion of the GC statistical hadronization model has been 
explicitly or implicitly based on the assumption that hadronic freeze-out coincides 
with hadronization. However, the GC model has also been applied successfully 
to hadro-chemical freeze-out at ./s down to a few GeV [19, 107, 108] where it 
is not expected that the dynamical evolution traverses the phase boundary at all, 
but grand canonical multiplicity distributions, and their characteristic strangeness 
enhancement pattern, are observed throughout. Toward lower ./s, T decreases 
while ug increases, as is shown in Fig. 7.33 which presents a compilation of all 
reported freeze-out parameters [108]. 

These points have also been included in the phase diagram of Fig. 7.1 which 
shows that they are gradually branching away from the phase separation boundary 
line that could recently be predicted by lattice QCD once new methods had been 
developed to extend the theory to finite ug [9, 10]. At ./s > 20 GeV we see that 


€c(QCD) © eg © €Gc (7.46) 


where egc is the freeze-out density inferred from GC analysis [19, 107, 108]. 

In turn, the GC hadronic freeze-out points drop below the lattice QCD coexis- 
tence line at lower ./s, implying that chemical freeze-out now occurs within the 
hadronic expansion phase. This requires a model of freeze-out, now governed by 
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Fig. 7.33 Energy i T j T j T : i 
dependence of the 
hadro-chemical freeze-out 
points obtained by grand 200 F RHIC 
canonical statistical model L 
analysis in the plane [T, uB], 
with interpolating curve at 
fixed energy per particle of 
about 1 GeV [107, 139] 


T, [MeV] 


100 


— np-4 ng = const 
—— (EY(N)- 1 GeV SIS 


the properties of a high density hadronic medium, upon expansive cooling and 
dilution. Holding on to the model of a quantum mechanical de-coherence decay to 
on-shell hadrons that we discussed in Sect. 7.3.3, we argue that an initial, extended 
high density hadronic fireball, given sufficient life-time at T smaller, but not far 
below 7;, could also be seen as a quantum mechanically coherent super-cluster, as 
governed by effective mean fields [133]. In such a medium hadrons, at T near To, 
acquire effective masses and/or decay widths far off their corresponding properties 
in vacuum: they are off-shell, approaching conditions of QCD chiral symmetry 
restoration as T — T, [134]. This symmetry is inherent in the elementary QCD 
Lagrangian, and “softly” broken within the light quark sector by the small non-zero 
current quark masses, but severely broken at T — 0 by the high effective constituent 
quark masses that get dressed by non perturbative QCD vacuum condensates. 
Pictorially speaking, hadrons gradually loose this dressing as T — Te [135], 
introducing a change, away from in vacuum properties, in the hadronic mass 
and width spectrum. Such in-medium chiral restoration effects have, in fact, been 
observed in relativistic A+A collisions, by means of reconstructing the in-medium 
decay of the p vector meson to an observed et e~ pair [136] (see Sect. 7.6.3). 

A dense, high T hadronic system, with mean-field induced off-shell constituents 
is also, clearly, quantum mechanically coherent. At a certain characteristic density, 
€ < €c, and temperature T < T., as reached in the course of overall hadronic 
expansion, this extended medium will undergo a decoherence transition to classical 
on-shell hadrons. Its frozen-out hadronic multiplicity distribution should be, again, 
characterized by the phase space weights of a grand canonical ensemble at T < To. 
Theoretical studies of such a mean field hadronic expansion mode [137] have also 
shown that such mechanisms play essentially no role at ./s > 20 GeV because 
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the expanding system is already in rapid flow once it traverses the phase boundary, 
with an expansion time scale shorter than the formation time scale of mean field 
phenomena. At lower energies, on the other hand, the system might not even dive 
into the deconfined phase but spend a comparatively long time in its direct vicinity, 
at the turning point between compression and re-expansion where all dynamical 
time constants are large, and the hadron density is high, such that the inelastic 
hadronic transmutation rate becomes high (particularly in collisions of more than 
two hadronic reactants, with reaction rates [138] proportional to €"), and sufficiently 
so for maintaining hadronic chemical equilibrium after it is first established at 
maximum hadron density, in low ./s systems that do not cross the phase boundary 
at all. 

The GC freeze-out parameters [T, u] at various ./s in Fig.7.33 permit a 
smooth interpolation in the T, u plane [139], which, in turn, allows for GC model 
predictions which are continuous in 4/s. Such a curve is shown in Fig.7.28 
compared to the 47 data points for the K * /z* multiplicity ratio in central collisions 
Au+Au/Pb+Pb, at all ./s investigated thus far. It exhibits a smooth maximum, due 
to the interplay of T saturation and upg fall-off to zero, but does not account for 
the sharp peak structure seen in the data at ys ~ 7 GeV and up ^ 480 MeV. 
This behavior is not a peculiarity of the K * channel only; it also is reflected in 
an unusually high Wroblewski ratio (see Eq. (7.39)) obtained at ./s = 7.6 GeV, 
of A, = 0.60 [19]. This sharp strangeness maximum is unexplained as of yet. It 
implies that hadron formation at this ./s reflects influences that are less prominent 
above and below, and most attempts to understand the effect [141—143] are centered 
at the assumption that at this particular J/s the overall bulk dynamics will settle 
directly at the phase boundary where, moreover, finite ug lattice theory also expects 
a QCD critical point [9-11]. This would cause a softest point to occur in the equation 
of state, i.e. a minimum in the relation of expansion pressure vs. energy density, 
slowing down the dynamical evolution [144, 145], and thus increasing the sensitivity 
to expansion modes characteristic of a first order phase transition [143], which 
occurs at ug > pe Such conditions may modify the K/z ratio (Fig. 7.28) [143]. 

It thus appears that the interval from top AGS to lower SPS energy, 5 < /s < 
10 GeV, promises highly interesting information regarding the QCD phase diagram 
(Fig. 7.1) in the direct vicinity of the parton-hadron coexistence line. In particular, 
the physics of a critical point of QCD matter deserves further study. Observable con- 
sequences also comprise so-called "critical fluctuations" [146, 147] of multiplicity 
density, mean transverse momentum and hadron-chemical composition [148], the 
latter in fact being observed near J/s = 7 GeV in an event by event study of the 
K /7 ratio in central Pb+Pb collisions [149]. We shall return to critical point physics 
in Sect. 7.7. 
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7.4 Elliptic Flow 


We have up to now mostly stressed the importance of central collisions and mid- 
rapidity data because they provide for the highest primordial energy density and 
avoid problems caused by emission anisotropy and the presence of cold spectator 
sections of target and projectile nuclei. On the other hand, a fundamentally new 
window of observation is opened by non-central collisions as the finite impact 
parameter breaks cylinder symmetry, defining emission anisotropies with respect 
to the orientation of the impact vector b as we have shown in Eq. (7.20). In a 
strongly interacting fireball collision dynamics, the initial geometric anisotropy 
of the reaction volume gets transferred to the final momentum spectra and thus 
becomes experimentally accessible. Furthermore, the high charged particle multi- 
plicity allows for an event-by-event determination of the reaction plane (direction 
of b), enabling the study of observables at azimuth ø, relative to the known reaction 
plane. We shall show that this opens a window into the very early stages of A+A 
collisions onward from the end of nuclear interpenetration, at t ~ 2 R(A)/ycM. 
Our observation thus begins at the extreme energy densities prevailing right at 
formation time (Sects. 7.2.4 and 7.2.5), i.e. concurrent with the initialization phase 
of relativistic hydrodynamic expansion. We access the phase diagram of Fig. 7.1 in 
regions far above the QCD phase boundary. 

Before turning to the details of elliptic flow data we wish to illustrate [96] 
the above statements. Figure 7.34 exhibits the transverse projection of primordial 
energy density, assumed to be proportional to the number density of participant 
nucleons in the overlap volume arising from a Au+Au collision at impact parameter 
b — 7 fm. The nuclear density profiles (assumed to be of Woods-Saxon type) 
intersect in an ellipsoidal fireball, with minor axis along the direction of b which 
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is positioned at y = 0. The obvious geometrical deformation can be quantified by 
the spatial excentricity (unfortunately also labeled e in the literature) 


x 
€x(b) = —— (7.47) 


where the averages are taken with respect to the transverse density profiles of 
Fig. 7.34. €x is zero for b = 0, reaching a value of about 0.3 in the case b = 7 fm 
illustrated in Fig. 7.34. 

Translated into the initialization of the hydrodynamic expansion the density 
anisotropy implies a corresponding pressure anisotropy. The pressure is higher in 
x than in y direction, and thus is the initial acceleration, leading to an increasing 
momentum anisotropy, 


f dx dy (T* — T?) 


ini Di: 7.48 
f dx dy (T** + T») (7:48) 


€p(t) = 


where TUS is the fluid's energy-momentum tensor. Figure 7.35 shows [96, 150] the 
time evolution of the spatial and momentum anisotropies for the collision considered 
in Fig. 7.34, implementing two different equations of state which are modeled with 
(without) implication of a first order phase transition in “RHIC” (“EOS1”). A steep 
initial rise is observed for €p, in both cases: momentum anisotropy builds up during 
the early partonic phase at RHIC, while the spatial deformation disappears. Le. the 
initial source geometry, which is washed out later on, imprints a flow anisotropy 
which is preserved, and observable as "elliptic flow". A first order phase transition 
essentially stalls the buildup of e, at about t = 3 fm/c when the system enters the 


Fig. 7.35 Time evolution of 
the spatial excentricity €y and 
the momentum space 
anisotropy €, (Eqs. (7.47) 
and (7.48)) in the 
hydrodynamic model of an 
Au-Au collision at b = 7 fm, 
occurring at ./s = 200 GeV 
[96]. The dynamics is 
illustrated with two equations 
of state 
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mixed phase, such that the emerging signal is almost entirely due to the partonic 
phase. We have to note here that the “ideal fluid" (zero viscosity) hydrodynamics 
[150] employed in Fig. 7.35 is, at first, a mere hypothesis, in view also of the fact 
that microscopic transport models have predicted a significant viscosity, both for a 
perturbative QCD parton gas [92, 151] and for a hadron gas [95, 152]. Proven to be 
correct by the data, the ideal fluid description of elliptic flow tells us that the QGP 
is a non-perturbative liquid [153, 154]. 

Elliptic flow is quantified by the coefficient v2 of the second harmonic term in 
the Fourier expansion (see Eq. (7.20)) of the invariant cross section; it depends on 
A/s, b, y and pr. Figure 7.36 shows the ./s dependence of v2 at mid-rapidity and 
averaged over pr, in Au+Au/Pb+Pb semi-peripheral collisions [93, 155]. We see 
that the momentum space anisotropy is relatively small overall, but exhibits a steep 
rise toward top RHIC energy. 

Figure 7.37 shows the (pseudo)-rapidity dependence of v2 at ./s = 130 and 
200 GeV as obtained by PHOBOS [156] for charged particles in minimum bias 
Au-Au collisions. It resembles the corresponding charged particle rapidity density 
distribution of Fig. 7.8, suggesting that prominent elliptic flow arises only at the 
highest attainable primordial energy density. 

That such conditions are just reached at top RHIC energy is shown in Figs. 7.38 
and 7.39. The former combines STAR [157] and PHENIX [158] data for the 
pr dependence of elliptic flow, observed for various identified hadron species 
m+, K*, p, K? and A, A in Au+Au at 200 GeV. The predicted hydrodynamic flow 
pattern [96, 159] agrees well with observations in the bulk pr < 2 GeV/c domain. 
Figure 7.39 (from [155]) unifies average v» data from AGS to top RHIC energies 
in a scaled representation [93] where v2 divided by the initial spatial anisotropy €x 
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Fig. 7.38 Transverse 
momentum dependence of 
elliptic flow v2 for mesons 
and baryons in Au+Au 
collisions at ./s = 200 GeV. 
The hydrodynamic model 
[96, 159] describes the mass 
dependence at pr < 2 GeV/c 
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is plotted versus charged particle mid-rapidity density per unit transverse area S, 
the latter giving the density weighted transverse surface area of primordial overlap, 
Fig. 7.34. Figure 7.39 includes the hydrodynamic predictions [95, 96, 150, 159, 161] 
for various primordial participant or energy densities as implied by the quantity 
(1/S) dnen/dy [93]. Scaling v2 by e, enhances the elliptic flow effect of near-central 
collisions where e, is small, and we see that only such collisions at top RHIC energy 
reach the hydrodynamical ideal flow limit in models that include an EOS ansatz 
which incorporates [96] the effect of a first order phase transition, which reduces 
the primordial flow signal as was shown in Fig. 7.35. 

At top RHIC energy, the interval between tọ ^ 0.6 fm/c, and hadronization 
time, ty ^ 3 fm/c, is long enough to establish dynamical consequences of an early 
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Fig. 7.39 Elliptic flow v2 
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approach toward local equilibrium. The “lucky coincidence" of such a primordial 
resolution of dynamical time scale, with the extreme primordial density, offered by 
semi-central collisions of heavy nuclei, results in an extremely short mean free path 
of the primordial matter constituents, thus inviting a hydrodynamic description of 
the expansive evolution. Consistent application of this model reveals a low viscosity: 
the primordial matter resembles an ideal fluid, quite different from earlier concepts, 
of a weakly interacting partonic gas plasma state (QGP) governed by perturbative 
QCD screening conditions [36, 41]. 

A further, characteristic scaling property of elliptic flow is derived from the 
pr dependence of v2, observed for the different hadronic species. In Fig. 7.38 one 
observes a hadron mass dependence, the v2 signal of pions and charged kaons rising 
faster with pr than that of baryons. Clearly, within a hydrodynamic flow velocity 
field entering hadronization, heavier hadronic species will capture a higher pr, at a 
given flow velocity. However, unlike in hadronic radial expansion flow phenomena 
(c.f. Sect. 7.2.6) it is not the hadronic mass that sets the scale for the total pr derived, 
per particle species, from the elliptic flow field, but the hadronic valence quark 
content. This conclusion is elaborated [94] in Fig. 7.40. 

The left panel shows measurements of the py dependence of v2 for several 
hadronic species, in minimum bias Au-Au collisions at /s = 200 GeV [161]. The 
middle panel bears out the hydrodynamically expected [162] particle mass scaling 
when v» is plotted vs. the relativistic transverse kinetic energy K Ey = my —m 
where mr = ( pr + m?)!/?, For K Ey > 1 GeV, clear splitting into a meson branch 
(lower v2) and a baryon branch (higher v2) occurs. However, both of these branches 
show good scaling separately. The right panel shows the result obtained after scaling 
both v? and K Er (i.e. the data in the middle panel) by the constituent quark number, 
ng = 2 for mesons and n, = 3 for baryons. The resulting perfect, universal scaling 
is an indication of the inherent quark degrees of freedom in the flowing matter as 
it approaches hadronization. We thus assert that the bulk of the elliptic flow signal 
develops in the pre-hadronization phase. 
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Fig. 7.40 v» vs. pr (left panel) and transverse kinetic energy K Er = mp—mg (middle) for several 
hadronic species in min. bias Au+Au collisions at ./s = 200 GeV, showing separate meson and 
baryon branches. Scaling (right panel) is obtained by valence quark number nq, dividing v? and 
K Er [94] 


The above scaling analysis has been extended to meson and Q hyperon 
production, and also to first PHENIX results [163] concerning elliptic flow of 
the charmed D meson [94], with perfect agreement to the observations made in 
Fig. 7.40, of a separation into meson/hadron branches on the K Er scale, which 
merge into a universal v2 scaling once both v? and K Er per valence quark are 
considered. The observation that the D meson charm quark apparently shares in the 
universal flow pattern is remarkable as its proper relaxation time is, in principle, 
lengthened by a factor M/T [164]. A high partonic rescattering cross section o 
is thus required in the primordial QGP fireball, to reduce the partonic mean free 
path A = 1/no (where n is the partonic density), such that à « A!/? (the overall 
system size) and, simultaneously, A < 1 fm in order to conform with the near-zero 
mean free path implication of the hydrodynamic description of the elliptic flow, 
which reproduces the data gathered at RHIC energy. The presence of a high partonic 
rescattering cross section was born out in a parton transport model study [92] of the 
steep linear rise of the elliptic flow signal with pr (Fig. 7.38). In such a classical 
Boltzmann equation approach the cross sections required for the system to translate 
the initial spatial into momentum space anisotropy, fast enough before the initial 
deformation gets washed out during expansion (c.f. Fig. 7.35), turn out to exceed 
by almost an order of magnitude the values expected in a perturbative QCD quark- 
gluon gas [92]. 

The non-perturbative quark-gluon plasma is thus a strongly coupled state (which 
has been labeled sQGP [165]). At RHIC energy this reduces the partonic mean free 
path to a degree that makes hydrodynamics applicable. A Navier-Stokes analysis 
[166] of RHIC flow data indicates that the viscosity of the QGP must be about ten 
times smaller than expected if the QGP were a weakly interacting pQCD Debye 
screened plasma. This justifies the use of perfect fluid dynamics. 
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Considering first attempts to derive a quantitative estimate of the dimensionless 
ratio of (shear) viscosity to entropy, we note that 7/s is a good way to characterize 
the intrinsic ability of a substance to relax toward equilibrium [167]. It can be 
estimated from the expression [94] 


n/s F2 The Cs (7.49) 


where T is the temperature, Ar the mean free path, and c, is the sound speed derived 
from the partonic matter EOS. A fit by the perfect fluid “Buda-Lund” model [168] 
to the scaled v? data shown in Fig. 7.40 yields T = 165 + 3 MeV; c, is estimated 
as 0.35 + 0.05 [94, 162], and A; ~ 0.30 fm taken from a parton cascade calculation 
including 2 < 3 scattering [169]. The overall result is [94] 


n/s = 0.09 + 0.02 (7.50) 


in agreement with former estimates of Teaney and Gavin [170]. This value is very 
small and, in fact, close to the universal lower bound of 7/s = 1/4 recently derived 
field theoretically [171]. 

Elliptic flow measurements thus confirm that the quark-gluon matter produced 
as J/s — 200 GeV is to a good approximation in local thermal equilibrium up to 
about 3-4 fm/c. In addition, the final hadron mass dependence of the flow pattern 
is consistent with a universal scaling appropriate for a nearly non-viscous hydrody- 
namic flow of partons, and the observed v» signal reflects a primordial equation of 
state that is consistent with first numerical QCD computations [153, 154, 165] of 
a strongly coupled quark-gluon plasma (sQGP) state. First estimates of its proper 
shear viscosity to entropy ratio, 7/5, are emerging from systematic analysis of the 
elliptic flow signal. At lower 4/s precursor elliptic flow phenomena are observed, 
as well, but are more difficult to analyze as the crucial, new feature offered by top 
RHIC energies is missing here: a clear cut separation in time, between primordial 
formation of local partonic equilibrium conditions, and hadronization. At RHIC 
(and at future LHC) energy elliptic flow systematics thus captures the emerging 
quark-gluon state of QCD at energy densities in the vicinity of € = 6-15 GeV/fm?, 
at temperature T ~ 300 MeV, and ug — 0, describing it as a strongly coupled, low 
viscosity liquid medium. 


7.5 In-medium Attenuation of High pr Hadronand Jet 
Production 


In the preceding sections we have followed the dynamical evolution of bulk matter 
in A+A collisions (at pr < 2 GeV which covers about 95% of the hadronic output), 
from initial formation time of partonic matter which reflects in charged particle 
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transverse energy and multiplicity density, also giving birth to hadrons, and to the 
elliptic expansion flow signal. 

An alternative approach toward QCD plasma diagnostics exploits the idea 
[41, 172] of implanting partonic products of primordial high Q? processes into the 
evolving bulk medium, that could serve as “tracers” of the surrounding, co-traveling 
matter. The ideal situation, of being able to scatter well defined partons, or electrons, 
from a plasma fireball, is approximated by employing primordially formed charm- 
anticharm quark pairs [41], or leading partons from primordial di-jet production 
[172]. Both processes are firmly anchored in perturbative QCD and well studied in 
elementary collisions (where such partons are directly released into vacuum), which 
thus serve as a reference in an analysis that quantifies the in-medium modification 
of such tracer partons. Not a surprise, in view of our above inferences, from elliptic 
flow, of a high temperature, strongly coupled primordial medium: these in-medium 
modifications are quite dramatic, leading to a suppression of J/ V production from 
primordial cc pairs (Sect. 7.6), and to high pr hadron and jet quenching, the subject 
of this chapter. 


7.5.1 High py Inclusive Hadron Production Quenching 


At top RHIC energy, ./s = 200 GeV, di-jet production from primordial hard pQCD 
parton-parton collisions (of partons from the initial baryonic structure functions) is 
the source of "leading" partons, with Ey up to about 30 GeV. They are derived from 
the inclusive cross section arising if the A+A collision is considered, first, as an 
incoherent superposition of independent nucleon-nucleon collisions, as enveloped 
within the target-projectile nucleon densities. In this framework, the pQCD cross 
section for producing an ET parton in A+B takes the form of “factorization” [173] 


do doap 
= d d. ——— 7.51 
dE dy 2 i5 Xa f. Xp. fajA (Xa) foj B (Xb) TAY (7.51) 


where the f(x) are the parton distributions inside projectile A and target B nuclei, 
and the last term is the pQCD hard scattering cross section. This equation describes 
the primordial production rate of hard partons, leading to the conclusion [172] that, 
at RHIC energy, all hadrons at pr > 6-10 GeV should arise from initial pQCD 
parton production. 

As partons are effectively frozen during the hard scattering, one can treat each 
nucleus as a collection of free partons. Thus, with regard to high pr production, the 
density of partons within the parton distribution function of an atomic number A 
nucleus should be equivalent to the superposition of A independent nucleons N: 


faja x, Q’) = A fos (x, Q’). (7.52) 
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From Eqs. (7.51) and (7.52) it is clear that the primordial high Q? inclusive 
parton cross section in a minimum bias A+B reaction scales as A - B times the 
corresponding (N+N or) p+p cross section. Furthermore, as each leading parton 
ends up in an observed high pr hadron h, we thus write the invariant hard hadron 
cross section as 


E doapg n/d? p = A- B- E dopp>n/ p. (7.53) 


Since nucleus-nucleus experiments usually measure invariant yields Nj for a given 
centrality bin, corresponding to an average impact parameter b, one writes instead: 


E dNap n (b)/d? p = (Tap(b)) E doy. /d^ p, (7.54) 


where TAp(5) is the Glauber geometrical overlap function of nuclei A, B at impact 
parameter b, which accounts for the average number of participant parton collisions 
at given impact geometry [174], (Ncou(b)). One can thus quantify the attenuating 
medium effects, as experienced by the primordially produced tracer parton on its 
way toward hadronization, by the so-called nuclear modification factor for hard 
collisions (analogous to Eq. (7.6), that refers to soft, bulk hadron production): 


d? NAg/dy dpr 


— 7.55 
(Tan (P) Popp /dy dpr eo) 


Rap(pt, y, b) = 


Obviously, this concept of assessing the in-medium modification of hadron produc- 
tion at high pr requires corresponding p+p collision data, as a reference basis. Such 
data have been, in fact, gathered at top RHIC, and top SPS energies, ./s = 200 and 
17.3 GeV, respectively. Alternatively, in situations where the relevant reference data 
are not known, one considers the production ratio of hadronic species h, observed 
in central relative to peripheral collisions: 


d^Ns(bi)/dy dpr — (Tag(b2)) 


I h P1 0 CPT 7.56 
d? Ny (b2)/dy dpr ii (TAB (b1)) PB) 


Rep(pt, y) = 


where bı < b» are the average impact parameters corresponding to the employed 
trigger criteria for "central" and "peripheral" A+A collisions, respectively. This ratio 
recasts, to a certain extent, the in-medium attenuation analysis, offered by Rap, 
insofar as peripheral collisions approach the limiting conditions, of a few single 
nucleon-nucleon collisions occurring in the dilute surface sections of the nuclear 
density profiles, i.e. essentially in vacuum. 

Employing the above analysis schemes, the RHIC experiments have, in fact, 
demonstrated a dramatic in-medium suppression of the high pr yield, for virtually 
all hadronic species. Figure 7.41 shows Raa for neutral pions produced in min. 
bias Cu+Cu and Au-Au collisions at ./s = 200 GeV where PHENIX extended the 
pt range up to 18 GeV/c [175]; the nuclear modification factor refers to the range 
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Fig. 7.41 The nuclear 
modification factor RAA for 
z” in min. bias Cu+Cu and 
Au-Au collisions at 

A/s = 200 GeV, in the range 
pr > 7 GeV/c [175], plotted 
vs. centrality as measured by 
participant nucleon number 
N, part 


Fig. 7.42 Raa for x? 
production in central Au+Au 
collisions at ./s = 200 GeV 
[175], compared to a 
hydrodynamic calculation for 
different opacities (transport 
coefficients [195]) of the 
plasma [204] 
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pr > 7.0 GeV/c and is shown as a function of centrality, reflected by Npart. We infer 
a drastic suppression, by an Raa approx0.25 in near central collisions. Figure 7.42 
shows the pr dependence [175] of neutral pion Raa in central Au+Au collisions 
(Npart = 350), with a suppression to below 0.2 at pr > 4 GeV/c. 

Note that Raa cannot reach arbitrarily small values because of the unatten- 
uated contribution of quasi-in-vacuum surface "corona" single nucleon-nucleon 
collisions, closely resembling the p + p — 2° + X inclusive yield employed in 
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the denominator of RAA, Eq. (7.55). Even in a central trigger configuration this 
suggests a lower bound, Raa ^ 0.15. Nuclear attenuation of high pr pions thus 
appears to be almost “maximal”, implying a situation in which the interior sections 
of the high energy density fireball feature a very high opacity, i.e. they are almost 
“black” to the high pr partons contained in pions. 

We remark here, briefly, on a confusing feature that occurs in Fig. 7.42 (and in 
the majority of other Raa vs. pr plots emerging from the RHIC experiments): 
the pQCD number of collision scaling employed in RAA does not describe pion 
production at low pr as we have demonstrated in Sect. 7.2. The entries at pr < 
3 GeV/c are thus besides the point, and so are pQCD guided model predictions, as 
shown here [176]. 

Nuclear modification analysis at RHIC covers, by now, the high pr production of 
a multitude of mesonic and baryonic species [177], most remarkably even including 
the charmed D meson which is measured via electrons from semi-leptonic heavy 
flavor decay by PHENIX [175] and STAR [178]. Figure 7.43 illustrates the first 
PHENIX results from central Au+Au at ./s = 200 GeV, Raa falling to about 
0.3 at 5 GeV/c. Heavy flavor attenuation thus resembles that of light quarks, as is 
also attested by the predictions of in-medium parton transport models [179, 180] 
included in Fig.7.43, which cast the medium opacity into an effective parton 
transport coefficient ĝ which is seen here to approach a value 14 GeV?/c at high pr, 
again corresponding to a highly opaque medium. We shall describe this approach in 
more detail below. 

A most fundamental cross-check of the in-medium attenuation picture of color 
charged partons consists in measuring Raa for primordial, "direct" photons. 


Fig. 7.43 The nuclear Au+Au Vs 200 Gev, 0 -10% Centrality 
modification factor Raa vs. 


pr for electrons from 1.4 PHENIX preliminary 
semi-leptonic decays of 
heavy flavor (mostly D) 
mesons in central Au+Au 1.2 (1) q = 0 GeV ?fc 
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Figure 7.44 shows first PHENIX results [181] for direct photon Raa vs. pr in 
central Au+Au at ./s = 200 GeV. Photons obey pQCD number of collisions scaling, 
Raa œ% 1! Also included in Fig. 7.44 are the attenuation ratios for neutral pions 
(already shown in Fig. 7.42), and for 7 mesons that follow the pattern of extreme 
suppression. In essence, the PHENIX results in Figs.7.43 and 7.44 wrap up all 
one needs to know for a theoretical analysis of fireball medium opacity, for various 
flavors, and indicate transparency for photons. 

We turn, briefly, to Rcp as opposed to RAA analysis, in order to ascertain similar 
resulting conclusions at RHIC energy. Figure 7.45 illustrates an Rcp analysis of 
7, p and charged hadron high pr production in Au+Au at ./s = 200 GeV by STAR 
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Fig. 7.44 Modification factor RAA vs. pr for direct photons in central Au+Au at 200 GeV 
(squares). Also shown are RAA for x? (triangles) and 7 (circles) [181] fitted by the attenuation 
model [176, 180] 
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Fig. 7.45 Rep, the ratio of scaled 2, p and charged hadron yields vs. pr in central (596) and 
peripheral (60-80%) Au+Au collisions at /s = 200 GeV [182] 
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[182], the central to peripheral yield ratio referring to a 5%, and a 60-80% cut of 
minimum bias data. We showed in Eq. (7.56) that the Rcp measure also refers to a 
picture of pQCD number of binary collision scaling, inappropriate at low pr. Thus 
ignoring the features of Fig. 7.45 at pr < 3 GeV/c we conclude that the high pr 
data again suggest a suppression by about 0.3, common to pions and protons, thus 
approaching the ratio, of about 0.2, observed in Figs. 7.41 and 7.42 which employ 
the “ideal” in-vacuum p + p — hadron + X reference. 

At top SPS energy, ./s = 17.3 GeV, the experimentally covered pr range is 
fairly limited [183], pr « 4 GeV/c. Figure 7.46 shows NA49 results, Rcp for 
p and charged pions. Contrary to former expectations that such data would be 
overwhelmed by Croonin-enhancement [184] the same systematic behavior as at 
RHIC is observed, qualitatively: Rcp (baryon)> Rcp (meson) at pr > 3 GeV/c. 
Note that, again, the data do not approach unity at pr — 0 because of the employed 
binary scaling, and that the strong rise of the proton signal at pr « 2 GeV/c is 
largely the result of strong radial flow in central Pb+Pb collisions, an effect much 
less prominent in pion pr spectra. The high pr suppression is much weaker than 
at RHIC but it is strong enough that the expected Croonin enhancement of high 
pr mesons is not observed. These SPS data, as well as first results obtained at 
the intermediate RHIC energy of \/s = 62.4 GeV [185] are reproduced by an 
attenuation model based on the primordial gluon density d Ng/d y that scales as 
the charged particle midrapidity density d Nen/d y [176], and was also employed in 
Figs. 7.43 and 7.44. 
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7.5.0 Energy Loss in a QCD Medium 


The attenuation model that we have hinted at consists of a gluon radiative energy 
loss theory of the primordially produced leading, high pr parton as it traverses 
a medium of color charges, by means of emission of gluon bremsstrahlung. 
We expect that the resulting partonic specific energy loss, per unit pathlength 
(i.e. its dE/dx) should reflect characteristic properties of the traversed medium, 
most prominently the spatial density of color charges [176] but also the average 
momentum transfer, per in-medium collision of the considered parton or, more 
general, per unit pathlength at constant density. Most importantly, an aspect of 
non-abelian QCD leads to a characteristic difference from the corresponding QED 
situation: the radiated gluon is itself color-charged, and its emission probability is 
influenced, again, by its subsequent interaction in the medium [186] which, in turn, 
is proportional to medium color charge density and traversed pathlength L. Thus, 
the traversed path-length L in-medium occurs, both in the probability to emit a 
bremsstrahlung gluon, and in its subsequent rescattering trajectory, also of length 
L, until the gluon finally decoheres. Quantum mechanical coherence thus leads to 
the conclusion that non-abelian dE/dx is not proportional to pathlength L (as in 
QED) but to L? [187]. 

This phenomenon occurs at intermediate values of the radiated gluon energy, 
€, in between the limits known as the Bethe-Heitler, and the factorization regimes 
[186], 


Opn & Aq? K w K ona © L? qZ/A < E (7.57) 
where à is the in-medium mean free path, qi the (average) parton transverse 
momentum square, created per collision, and E the total cm energy of the traveling 
charge. In the BDMPSZ model [186—188] the properties of the medium are encoded 


in the transport coefficient, defined as the average induced transverse momentum 
squared per unit mean free path, 


i-o. 05b 


The scale of the radiated gluon energy distribution œ dN/dw is set by the 
characteristic gluon energy [186, 187] 


aL. (7.59) 


NI = 


Wc = 


To see more explicitly how the various properties of the color charged medium enter 
in ĝ we rewrite it as 


i do z 
G=p J qi dq — = pe at) =i ' ai) (7.60) 
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where p is the color charge density of scattering centers, o the effective binary 
cross section for interaction of the considered leading parton at scale q? (which 
may depend on quark flavor), and (az) as above. Obviously, both o and TA refer to 
detailed, intrinsic properties of the QCD medium, globally described by density p. 
The leading parton cross section with in-medium color charges should depend on 
the implied resolution scale, Q? = (a2). and can thus be obtained from perturbative 
QCD [186-188] only if Q? > Q2, the saturation scale that we discussed in 
Sect. 7.2. Likewise, (a2) itself reflects a medium property, the effective range of the 
color force, which is different in confined and deconfined media. Hadron size limits 
the force range in the former case, such that g is minimal in ground state hadronic 
matter also, of course, due to the small energy density o = oo = 0.15 GeV/fm? 
[189]. This was, in fact, confirmed by a RHIC run with deuteron-gold collisions, in 
which mid-rapidity hadrons traverse sections of cold Au nucleus spectator matter. 
Figure 7.47 shows results obtained for R;4 dependence on pr, for 2° from PHENIX 
[190], and for charged hadrons from STAR [191]. For comparison, both panels 
also include the corresponding Raa data for central Au+Au collisions (all at 
A/s = 200 GeV/c), exhibiting the typical, drastic high pr quenching of inclusive 
hadron production, clearly absent in d+Au collisions. 

We have shown a first application of the BDMPSZ model, to RHIC inclusive D 
meson production [179], in Fig. 7.43. Before engaging in further model application 
we note, first, that Eqs. (7.57)-(7.60) above refer to the idealized conditions of an 
infinitely extended medium of uniform composition. In reality, the fireball medium 
expands, at time scale concurrent with the proper time incurred in the leading 
partons propagation over distance L, such that all ingredients in ĝ, exhibited in 
Eq. (7.60), vary with expansion time [192]. However, before turning to adaption to 
reality of the infinite matter model, we wish to expose its prediction for the final 
connection of the specific average partonic energy loss (A E) and in-medium path 


e d+Au FIPC-Au 0-20% 
a d+Au Minimum Bias 


18r PHENIX 20 r- 


0 | | | | | | | | | 0 | l | | | 
0 1 2 3 4 5 6 7 8 9 10 0 2 4 6 8 10 


pr [GeV/c] P; [GeV/c] 


Fig. 7.47 Raa vs. pr for d+Au collisions at ./s = 200 GeV, compared to central Au+Au results, 
for zx? (left) and charged hadrons (right). From [190, 191] 
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length L traversed: 
dN ^12 
(AE) — GT dw x o, Cg q L^. (7.61) 
w 


This relation [193] represents the eikonal approximation limit of extended medium 
and high leading parton initial energy E > we (Eq. (7.59)). The average energy loss 
is thus proportional to the appropriate strong coupling constant œs, to the Casimir 
factor corresponding to the leading parton (with value 4/3 for quarks, and 3 for 
gluons), as well as to ĝ and L?. 

In order to verify the non-abelian nature of radiative parton energy loss in a 
partonic QCD medium it would, of course, be most convincing if a direct, explicit 
L? dependence of (AE) could be demonstrated. Such a demonstration is lacking 
thus far, chiefly because of the obvious difficulty of simultaneously knowing the 
primordial parton energy E, the transport coefficient Q and—in finite nuclear col- 
lision geometry—its variation along the actual traversed path L as the surrounding 
medium expands with propagation time. Moreover, the partonic medium induced 
energy loss AE of the primordial parton is not directly observable. Even if we 
assume that high pr partons evolve into the observed hadrons only after leaving the 
fireball medium [176], their ensuing “fragmentation” to hadrons (which is known 
from p + p jet physics) results in several hadrons usually comprising a “leading” 
hadron which transports a major fraction (z) = (E BE P) of the fragmenting parton 
energy EP, which, in turn, equals EP (primordial)—A E, with AE sampled from 
a probability distribution with mean (A E) according to Eq. (7.61). The observed 
leading hadron energy or transverse momentum is thus subject to sampling, both, 
z from the fragmentation function, and AF from in-medium energy loss. Finally, 
inclusive high pr leading hadron observation in A+A collisions involves an average 
over all potential initial parton production points, within the primordially produced 
density profile. A specific distribution of in medium path lengths f(L) arises, for 
each such production point, which, moreover, depends on a model of space-time 
fireball expansion. The final inclusive yield thus requires a further, weighted volume 
average over f(L) per production point. Thus, typical of an inclusive mode of 
observation, the “ideal” relationship of Eq. (7.61), between radiative in-medium 
energy loss AE and traversed path length L gets shrouded by double averages, 
independently occurring at either side of the equation [176, 179, 189, 194—196]. A 
detailed L? law verification cannot be expected from inclusive central collision data 
alone (see next section). 

However, the unmistakably clear signal of a strong, in-medium high pr parton 
quenching effect, gathered at RHIC by Raa measurement for a multitude of 
hadronic species (Figs. 7.20, 7.41, 7.42, 7.43, 7.45, and 7.47), in Au+Au collisions 
at ./s = 200 GeV, has resulted in first estimates of the transport coefficient 4, 
the medium—specific quantity entering Eq. (7.61), in addition to the geometry— 
specific path length L. In fact, the transport coefficient can, to some extent, be 
analyzed independently, owing to the fact that g œ o from Eq. (7.60). The density 
Q falls down rapidly during expansion, but it is initially rather well constrained 
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by the conditions of one-dimensional Bjorken expansion that we have described 
in Sects. 7.2 and 7.4. The major contribution to partonic AE arises in the early 
expansion phase (via a high à), in close analogy to the formation of the elliptic flow 
signal. These two signals are, thus, closely correlated: the primordial hydrodynamic 
expansion phase of bulk matter evolution sets the stage for the attenuation, during 
this stage of QCD matter, of primordially produced “tracer” partons, traversing the 
bulk matter medium as test particles. 

The bias in partonic A E to the primordial expansion period is borne out in an 
expression [193, 195] which replaces the 4 coefficient, appropriate to an infinitely 
extended static medium considered in the original BDMPSZ model, by an effective, 
expansion time averaged 


L 
onzi de-máo (7.62) 

to 
to be employed in the realistic case of an expanding fireball source. Due to the 
rapid fall-off of o, ing = oo CA from Eq. (7.60), the integral depends, far more 
strongly, on g (f ~ fo) than on total path length L. Furthermore, inserting der into 
the BDMPSZ formula [193, 195] for the transverse momentum downward shift, 
occurring in leading parton or hadron pr spectra (of power law form prz", see 


Fig. 7.20) 
A pr A —d5,/ mL? pr/v, (7.63) 


we see that the first order proportionality to L? is removed. The downward pr shift 
is thus, primarily, a consequence of Gere which, in turn, is biased to reflect the “ideal” 
transport coefficient g at early evolution time. Within this terminology, the pr 
shift (see Eq. (7.63)) determines the experimentally measured ratio Raa (pr) which 
quantifies the effective transport coefficient Gere for the pr domain considered. It 
can be related, as a cross check, to the initial gluon rapidity density if the collision 
region expands according to Bjorken scaling [187, 197]: 

G = as 2g? a (7.64) 
A typical result of application of the model described above [195] is shown in 
Fig. 7.48. Analogous to Fig. 7.41, Raa for neutral pions and charged hadrons is 
averaged over the range 4.5 < pr < 10 GeV/c, and shown as a function of 
centrality (assessed by Npart) in minimum bias Au+Au collision at /s = 200 GeV 
[190, 191, 198]. A path-averaged q.s of 14 GeV?/fm is inferred from the fit, in close 
agreement to the value found in [195]. 

A more recent study [199] of the PHENIX Raa data for x? in central Au+Au 
collisions (Fig. 7.42) is shown in Fig. 7.49. The analysis is carried out in the frame- 
work of the WHDG model [200], which replaces the (effective) transport coefficient 
q (employed in the BDMPSZ model [186-188], and turned into the data analysis 
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Fig. 7.48 The effective 
transport coefficient 

ĝ = 14 GeV?/fm in the 
parton quenching model 
(POM) of [195] determined 
from the centrality 
dependence of RAA for x? 
and charged hadrons, 
averaged over 4.5 

< pr < 10 GeV/c, in AucAu 
at /s = 200 GeV 


Fig. 7.49 Application of the 
WDRG transport model [200] 
based on Eq. (7.64) to 
PHENIX Raa data for zx? 
[175], indicating primordial 
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formalism of [193, 195]) by the primordial gluon mid-rapidity density dN? /dy, as 
the fundamental parameter, via Eq. (7.64). The initial gluon density, in turn, being 
related to the charged hadron mid-rapidity density [61]. Figure 7.49 shows that, 
within the still preliminary statistics at pr > 10 GeV/c, the “conservative” estimate 
of as = 0.3 and dN /dy = 1000 does not appear to be the most appropriate choice, 
the data rather requiring 1000 < dN&/dy < 2000. Overall, Fig. 7.49 demonstrates 
a certain insensitivity of the data, to the basic parameters of theoretical high pr 
quenching models of inclusive hadron production at RHIC energy, that we have 
already inferred from Fig. 7.43, concerning choice of q. 

Radiative in-medium energy loss of primordially produced partons, traversing 
the evolving bulk medium as “tracers”, must be extremely strong. This is indicated 
by the inclusive attenuation ratios RAA, which fall down to about 0.2 in central colli- 
sions thus almost reaching the absolute lower limit of about 0.15 that arises from the 
unavoidable fraction of un-attenuated primordial surface “corona” nucleon-nucleon 
interaction products [201]. 
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Not expected from early estimates of medium opacity based on the picture of a 
weakly coupled perturbative QCD medium [172, 202], the interior sections of the 
central Au+Au interaction volume must be almost “black” toward high pr parton 
transport [194, 195] at ./s = 200 GeV, also including charm quark propagation 
(Fig. 7.43). The remaining signal should thus stem, primarily, from the dilute surface 
sections, and from the finite fraction of partons traversing the interior with small, or 
Zero radiative energy loss, as a consequence of the finite width of the A E probability 
distribution [193, 194]. Seen in this light, the smooth decrease of RAA with centrality 
(Fig. 7.48) should reflect the combined effects, of a decreasing surface to volume 
ratio, an increasing effective g (due to interior density increase) that confronts the 
increasing average geometrical potential path length (L) (essentially enhancing its 
effect), and a thus diminishing fraction of primordial high pr partons experiencing 
asmall AE. 

Not surprisingly, the ideal non abelian QCD relationship of AE proportional 
to in-medium high pr parton path length L? can, thus, not be established from 
inclusive high py quenching data alone. We shall show in the next section that di-jet 
primordial production can offer a mechanism of higher resolution. The inclusive 
Raa attenuation data, obtained at RHIC, are seen to establish an unexpected, 
high opacity of the primordial interaction volume, extending to high pr parton 
propagation. The required, high transport coefficient G = 14 GeV?/fm from 
Fig. 7.48, confirms and extends the picture derived from elliptic flow data [167]: 
at top RHIC energy the plasma is non-perturbatively, strongly coupled, a new 
challenge to lattice QCD [61]. The QGP may be largely composed of colored, string- 
like partonic aggregates [203]. 


7.5.3 Di-jet Production and Attenuation in A+A Collisions 


In order to analyze leading parton attenuation in a more constrained situation [204], 
one investigates parton tracer attenuation under further geometrical constraints 
concerning the in-medium path length L, by means of di-jet analysis, and/or by 
varying the primordial parton density that enters g via Eq. (7.60) in studies at 
different ./s while maintaining the observational geometrical constraints. 

We shall concentrate here on di-jet attenuation data obtained in Au+Au collisions 
at top RHIC energy, ys = 200 GeV. At this relatively modest energy the initial 
pQCD production cross section of leading partons (as described in Eq. (7.51)) 
reaches up to pr = 25 GeV/c. The ensuing DGLAP shower multiplication initiates 
“parton fragmentation” to hadrons [83, 85, 121], each carrying a momentum fraction 
ZT = pt/pr (primord. parton). The created ensemble of hadrons h belonging to the 
observed hadronic jet can be wrapped up by the total fragmentation function 


d 
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which summarizes the contributions arising from the different shower partons i. 
Here, C; are the weight coefficients of the particular process, and Dpart(i)+n are the 
individual fragmentation functions (FFs) for turning parton i into hadron A. Similar 
to the parton distribution functions (PDFs) in Eq. (7.51), derived from deep inelastic 
electron-parton scattering (DIS) and shown in Fig. 7.14, the FFs are semi-empirical 
non-perturbative QCD functions that have an intuitive probabilistic interpretation. 
They quantify the probability that the primordial parton produced at short distance 
1/Q fragments into i shower partons, forming a jet that includes the hadron h [205, 
206]. 

At Fermilab energy, ./s = 1.8 TeV, the jet spectrum reaches up to Ep © 
400 GeV, and a typical 100 GeV jet comprises about 10 hadrons which can 
be identified above background by jet-cone reconstruction algorithms [205]. This 
allows for a complete determination of the corresponding fragmentation function, 
and for a rather accurate reconstruction of the py and Er of the primordial parton 
that initiated the jet. Similar conditions will prevail in jet spectroscopy of Pb+Pb 
collisions at LHC energy, ./s = 5.5 TeV. 

However, at RHIC energy a typical jet at 15 < Ey < 25 GeV features a 
fragmentation function comprised of a few hadrons with ET in the 2-15 GeV range. 
Considering the high background, arising in the lower fraction of this energy domain 
from concurrent, unrelated high pr hadron production processes, a complete jet- 
cone analysis cannot succeed. The RHIC experiments thus confront back-to-back 
di-jet production with an analysis of the azimuthal correlation between high pr 
hadrons. Defining the observational geometry by selecting a high pr “trigger” 
hadron observed at azimuthal emission angle quig, the associated production of 
high pr hadrons is inspected as a function of Ag = ass — Quig. If the trigger 
has caught a leading jet hadron one expects the hadrons of the balancing back-to- 
back jet to occur at the side opposite to the trigger, Ag ~ zr. The trigger condition 
thus imposes the definition of a “near-side” and an “away side" azimuthal domain. 
Furthermore, the relatively narrow rapidity acceptance of the STAR and PHENIX 
experiments (centered at y = 0) selects di-jets with axis perpendicular to the beam 
direction. 

Originating from a uniform distribution of primordial back-to-back di-parton 
production vertices, throughout the primordial reaction volume, the trigger selected 
di-jet partons thus experience an (anti-)correlated average path length (L) to arrive 
at the surface while experiencing medium-specific attenuation, with (Lais) x 2R-— 
(Laway), R being the transverse medium radius. No such geometric constraint exists 
in the study of inclusive high pr hadron production. We thus expect information 
different from the inclusive Raa (pr) signal. The geometrical selectivity can be 
even further constrained by fixing the direction of the impact parameter (i.e. the 
reaction plane) in semi-central collisions (recall Sect. 7.4), and observing the di-jet 
correlation signal in dependence of the di-jet axis orientation relative to the reaction 
plane. 

The very first di-hadron correlation measurements confirmed the existence of 
strong in-medium attenuation. Figure 7.50 shows the azimuthal yield distributions, 
per trigger hadron, as observed by STAR at ./s = 200 GeV [207]. The left panel 
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Fig. 7.50 Di-hadron correlation from back-to-back di-jet production in Au+Au collisions at ./s = 
200 GeV. The trigger particle is at azimuth  — 0, with a pr > 4 GeV threshold. The away side 
peak at A® = v is observed (left panel) in p+p, d+A but absent in central Auc Au. Right panel 
shows the correlation in Au Au for different orientations of the trigger direction relative to the 
reaction plane [207] 


shows the distribution of hadrons with pr > 2 GeV/c relative to a trigger hadron 
with m > 4 GeV/c. Data for p+p, d+Au and central Au+Au are illustrated. At 
the “near side” (the trigger position is ® = 0) all three reactions exhibit a similar 
narrow distribution of hadrons associated with the trigger hadron, typical of a jet 
cone fragmentation mechanism. Note that the associated near-side central Au+Au 
signal thus exhibits no signs of an attenuation softened fragmentation function, 
indicating that the trigger imposed high pr hadron should predominantly stem from 
primordial jet production vertex points located near to the surface of the reaction 
volume, in azimuthal trigger direction. Thus, conversely, the balancing opposite 
jet has to develop while traversing almost the entire transverse diameter of the 
interaction volume. I.e. (Loppos) ~ 2 R thus emphasizing the expectation that di-jet 
spectroscopy should allow for stricter constraints on path length L in comparison 
to single high pr hadron Raa analysis. In fact, no trigger related away side signal 
of pr > 2 GeV/c hadrons is observed in Fig. 7.50 for central Au+Au collisions, 
whereas p+p and central d+Au collisions exhibit a clear away-side di-jet signal. 

We conclude that the trigger bias, selecting a single near side hadron of pr > 
4 GeV/c in central Au+Au collisions, responds to a primordial di-jet of about 
10 GeV per back-to-back parton. After traversal of in medium average path length 
L — 2 R the fragmentation function of the opposite side parton contains on 
average no hadron at pr > 2 GeV/c, indicating that it should have lost a fraction 
(AET) > 5 GeV. The medium is thus highly opaque, but the total disappearance 
of the opposite side signal can only provide for a lower limit estimate of (A ET), 
within the trigger conditions employed here. We shall show below that the situation 
changes with more recent RHIC data [208] that extend the trigger hadron pr range 
toward 20 GeV/c. 

However, the right hand panel of Fig. 7.50 shows that an improved constraint 
on partonic in-medium path length can already be obtained by studying the di-jet 
back-to-back production geometry in correlation with the orientation of the reaction 
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plane that arises from non-zero impact parameter in semi-central A+A collisions. 
We have seen in Fig.7.34 that such collisions exhibit an elliptical primordial 
transverse density profile, with minor axis along the impact vector b, defining the 
reaction plane. Di-jets with axis “in-plane” thus traverse a shorter in-medium path 
length as orthogonal “out-of-plane” jets, the difference in average path length being 
quantified by the spatial excentricity e(b), Eq. (7.47). Figure 7.50 (right) shows the 
di-hadron correlation results in semi-peripheral Au+Au collisions, as compared to 
the in-vacuum p+p reference. At the 20-60% centrality window employed here, out 
of plane jet emission occurs along the major axis, the reaction volume diameter 
still of magnitude 2R, as in central collisions. The trigger condition thus again 
selects opposite side path lengths L — 2 R, but the energy density should be 
lower than in central collisions. Even so, the average opacity along the away side 
parton path appears to be high enough to wipe out the correlation signal. In-plane 
geometry, however, shows a partially attenuated signal (as compared to the global 
ptp reference) at the opposite side, corresponding to path lengths L ~ R. These 
data thus provide for first information concerning the relation of (A E) and average 
traversed path length [176]. 

Obviously, one wants to extend the above study to higher di-jet energies, i.e. 
to measurement of di-hadron correlations at hadron trigger pr — 20 GeV/c, 
conditions sampling the very low primordial cross section, at ./s = 200 GeV, 
of jet production at primordial Ey — 30 GeV [208, 209]. Figure 7.51 shows the 
corresponding jet correlations selected with high pr trigger, 8 < p" < 15 GeV/c, 
and high pr associated hadrons, pr > 6 GeV/c, in minimum bias d+Au, semi- 
central Au+Au and central Aut+Au at ./s = 200 GeV. Very clear and almost 
background-free back-to-back jets are observed in all three cases, in sharp contrast 
with the away side jet total disappearance in central Au+Au at lower jet energy, 
Fig. 7.50 (left panel). The near side trigger associated hadron yield decreases only 
slightly from d+A to central Au+Au, while the away side yield drops down by an 
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Fig. 7.51 Di-hadron correlation at high pr in central Au+Au collisions at ys = 200 GeV, 


trig 


compared to d+Au and peripheral Au+Au; for 8 < py” < 15 GeV, and p7*°° > 6 GeV. From 
[208] 
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attenuation factor of about 0.2, the signal thus not being completely extinguished. 
We infer RAA(L — 2R) © 0.2. 

In order to show how such data can be evaluated in a picture of in-medium lead- 
ing parton attenuation (as finally reflected in leading hadron production observed 
in the above di-hadron correlation data) we briefly consult the pQCD factorization 
[210] prediction for the inclusive production of a high pr hadron at central rapidity, 
in the nuclear collision A + B > h + x [196], 


d^cAp.. n 4 z 
— —. = Kmo a dr dx, dxp dze FajA(xa, Q?, T) 


d? pr dy E 
x Fyjp p, Q^, b — TENEN Q?) 
d Prio dye 
1 2 
*z Dnjc(zc, Q”) (7.66) 


where the parton (a, b) distribution functions F in nucleus A, B and the elementary 
pQCD cross section for a+b — c+ x have been already implied in Eq. (7.51). Their 
spatial integral gets convoluted with the fragmentation function D that describes the 
conversion of the leading parton c to a hadron carrying a fraction 0 < Ze < 1 of its 
transverse momentum. K is a factor introduced as a phenomenological correction 
for “next to leading order" (NLO) QCD effects. Within the (further) approximation 
that the leading parton c suffers medium induced gluon bremsstrahlung energy 
loss but hadronizes outside the interaction volume (in vacuum), the in-medium 
quenching leads, merely, to a re-scaling of the fragmentation function, 


Dra = f de Po— Die (, zr - o?) , (7.67) 
where the primary parton is implied to lose an energy fraction e = AE/E. with 
probability P (e) [196]. Therefore the leading hadron is a fragment of a parton with 
reduced energy (1 — €) Ec, and accordingly must carry a larger fraction of the parton 
energy, zc/(1 — €). If no final state quenching is considered, P (€) reduces to 6(€). 
The entire effect of medium attenuation on the leading parton is thus contained in 
the shift of the fragmentation function. 

An application of this formalism [196] is shown in Fig. 7.52. The in-medium 
modification of the hadron-triggered fragmentation function (see Eq. (7.67)) is 
evaluated for central Au+Au collisions at ./s = 200 GeV. In adaptation to the 
modalities of RHIC di-hadron correlation data, the opposite side fragmentation 
function (for observation of trigger-related hadrons with pr > 2 GeV/c) is studied 
in dependence of the trigger selected pr window. Its attenuation is quantified by the 
ratio D (with quenching) to D (without quenching), as a function of the fraction 
zT. of opposite side hadron pr to trigger hadron pr. Referring to the observational 

trig 


conditions implied in Fig. 7.50 (left) p. ^ ~ 4-6 GeV/c and opposite side pr > 
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2 GeV, the predicted suppression amounts to a factor of about 0.35, whereas the 
data imply a factor smaller than 0.2. Likewise, comparing to the data of Fig. 7.51, 
the (harder) trigger conditions should lead to a suppression of about 0.45 from 
Fig. 7.52 but the observed value is close to 0.2. The predictions appear to fall short 
of the actually observed suppression. This calculation employs an ansatz for the 
transport coefficient ĝ similar to Eq. (7.64), recurring to the primordial gluon density 
at to < 0.6 fm/c which, in turn, is estimated by the charged hadron mid-rapidity 
density [197]. However, this pQCD based argument can also not reproduce the 
magnitude of the effective transport coefficient (Eq. (7.62)), Gert ~ 10-15 GeV?/fm, 
shown in refs. [194, 195, 199] to be required by the large observed suppression (see 
Fig. 7.43). 

It has been argued [199, 211] that these models need refinement by introducing 
more realistic dynamics, and/or by completely abandoning the pQCD ansatz 
for hard parton in-medium transport [212, 213]. We shall return to these novel 
suggestions, of how to treat dynamical, non equilibrium quantities of the “strongly 
coupled" parton plasma of non perturbative QCD (toward which equilibrium lattice 
theory can only give hints), in our final conclusion (Sect. 7.8). In the meanwhile, 
we note that the expected non abelian behavior, AE œ L?, could not be verified 
quantitatively, as of yet [211], because of the unexpectedly high opacity of the 
fireball interior sections at ./s = 200 GeV, in combination with the limited jet 
energy range that is available at RHIC energy. It appears possible, however, to 
extend the analysis of the “back side jet re-appearance" data [208, 209] (Fig. 7.51) 
toward this goal. The situation should improve at LHC energy, J/s = 5.5 TeV, 
where primordial 100-200 GeV jets are abundant, such that the opposite side jet 
can be reconstructed with explicit use of the complete Fermilab jet cone recognition 
algorithms [205] even if their in-medium E7 loss ranges up to 50 GeV. 
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Fig. 7.53 Di-hadron correlation: away side emission pattern in central Au+Au collisions, com- 
pared to pp data by STAR [214] (left panel) and to peripheral Au+Au (right panel) by PHENIX 
[215] 


A further prediction of the model employed in Fig. 7.52 has been confirmed 
by the RHIC experiments. The high medium opacity at top RHIC energy leads 
to an intriguing emission pattern of low pr opposite side hadrons. Clearly, the 
trigger-selected ET flux, of up to 20 GeV, toward the away-side, cannot remain 
unnoticeable. Inspection of the attenuated fragmentation functions [196] in Fig. 7.52 
reveals an enhanced emission of bremsstrahlung gluon hadronization products at 
zr < 0.1 This fraction of in-medium jet-degradation products has in fact been 
observed, as is shown in Fig. 7.53. The left panel shows STAR results [214] for 
the di-hadron correlation in central Au+Au at ys = 200 GeV, with near-side 
hadron trigger 4 < pr < 6 GeV/c, and opposite side observation extended to soft 
hadrons, 0.15 < pr < 4 GeV/c. A prominent double-peak structure is indicated, 
symmetric about A® = x. The right panel shows high resolution PHENIX results 
[215] for Au+Au at three centralities, from peripheral to central collisions. For the 
former, the typical pt+p-like away side peak (c.f. Fig. 7.47) is recovered, while a 
double peak appears in the latter case, shifted from Ad = 2 by +5@ ~ 70°. A 
hypothetical mechanism comes to mind [213], of sideward matter acceleration in 
a “Mach-cone” directed mechanism of compressional shock waves initiated by the 
in-medium energy loss of the opposite side leading jet parton, which traverses the 
medium at “super-sonic” velocity, i.e. at v > vs, the appropriate speed of sound in 
a parton plasma. 

If confirmed by pending studies of the away-side multi-hadron correlation, that 
might ascertain the implied conical shape of the soft hadron emission pattern (about 
the direction A® ~ 7 of the leading parton), this mechanism might lead to the 
determination of the sound (or shock wave) velocity of a strongly coupled parton 
plasma: a third characteristic QGP matter property, in addition to viscosity n (from 
elliptic flow) and 4 (from high pr parton attenuation). We note that the implied 
concept, of measuring the shock wave transport velocity of a strongly interacting 
medium, dates back to the 1959 idea of Glassgold et al. [216], to study shock wave 
emission in central p+A collisions at AGS and PS proton energy, of about 30 GeV. 
In hindsight we can understand the inconclusiveness of such searches: the “low 
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energy" incident proton does not preserve its identity, breaking up into relatively 
soft partons in the course of traversing the target nucleus. Their coupling to the 
surrounding cold baryonic target matter is weak, and possible collective transverse 
compressional waves are swiftly dissipated by the cold, highly viscous [61] hadronic 
medium. The shock compression and Mach-cone emission mechanism was then 
revived by Greiner et al. [217] for central "Ne + U collisions at Bevalac energy 
[218]. The recent RHIC observations in Fig. 7.53 have been called "conical flow" 
[219]; they demonstrate, again, the strong coupling to the medium, of even a 
20 GeV/c leading parton [220]. 


7.6 Vector Meson and Direct Photon Production: 
Penetrating Probes 


This chapter is devoted to three observables that have been profoundly studied 
already at the SPS, as tracers of the created fireball medium: 


1. J/ V “charmonium” production, as suppressed in a high T QGP medium, 
2. "direct" photons as black body T sensors of the QGP state; 
3. p meson in medium production, studied by di-lepton decay. 


These three observables represent an internally connected set of ideas of high 
density QCD matter diagnostics: all serve as medium tracers but in a complementary 
way. Matsui and Satz [41] realized that at the modest top SPS energy, 17.3 < 
A/s < 20 GeV, the production rate of cc pairs (that would in part develop toward 
charmonium production, J/ V, Y’, etc.) was so low that only the primordial, first 
generation nucleon-nucleon collisions in an A+A event would create a measurable 
yield (nothing after thermalization). Le. the primordial cc yield would be well- 
estimated by A4/3 Opp (cc). Initially produced in a superposition of states including 
color triplets and octets [221] the emerging cc pairs thus co-travel with the 
developing high energy density fireball medium, as tracers, on their way toward 
J/ or V' and D, D formation. Attenuation, characteristic of medium properties, 
will break up leading cc pairs resulting in a suppression of the eventually observed 
J/W yield: another “quenching” observable. 

As the suppression of J/y is related to medium temperature and density, the 
extent of charmonium quenching should also be related to the rate of black body 
thermal fireball radiation, by photon emission via elementary gg — gy and 
qg — qy processes in the plasma [222, 223]. Photons leave the interaction volume 
essentially un-rescattered, and their radiative intensity, proportional to T^, makes 
them an ideal probe of the initial fireball temperature. This thus could be an ideal 
diagnostics of the early deconfined matter, but it is difficult to disentangle from a 
multitude of concurrent low pr photon sources [224], most prominently x? > 2y 
decay, with cross sections higher by several orders of magnitude. The thermal 
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photon signal thus becomes more promising the higher the initial temperature 7; 
which might reach up to 500-600 MeV at LHC energy. 

Similar to photons, in medium created lepton pairs [225] escape essentially un- 
attenuated as was shown for the Drell-Yan process, gg — LL by CERN experiment 
NA38/NA50 [226]. Thermal di-lepton production in the mass region < 1 GeV is 
largely mediated by light vector mesons. Among these, the p meson is of particular 
interest due to its short lifetime (1.3 fm/c), making it an ideal tracer (via its in- 
medium di-lepton decay) for the modification of hadrons composed of light quarks, 
in the vicinity of T = Te. This modification signal is thus complementary to the 
heavy quark charmonium J / V suppression (break up) effect that sets in at T > 1.5— 
2T. (see below). Moreover, in addition to the deconfinement breakup mechanism 
by QCD plasma Debye screening of the color force potential acting on the cc pair 
[41], the QCD chiral symmetry restoration mechanism [227] can be studied via 
in-medium modification of the o spectral function as T — T,. Note that the in 
vacuum p mass and width properties are owed to non-perturbative QCD condensate 
structures [1, 228] which spontaneously break the chiral symmetry of the QCD 
Lagrangian, at T — 0. These properties should change, in the vicinity of Tę, and be 
reflected in modifications of the di-electron or di-myon decay spectra—unlike the 
suppression effect on the J/ V which simply dissolves primordial cc pairs before 
they can hadronize, a yes-no-effect whose onset with ./s or centrality serves as a 
plasma thermometer, by observing Raa or Rcp < 1. 


7.6.1] Charmonium Suppression 


Due to the high charm and bottom quark masses, the “quarkonium” states of cc and 
bb can be described in non-relativistic potential theory [229, 230], using 


Vise (7.68) 
T 


as the confining potential [231], with string tension o. = 0.2 GeV? and gauge 
coupling a = 2/12. We are interested in the states J/W (3.097), Xe (3.53) and 
W’ (3.685) which are the 1S, 1P and 2S levels. The decay of the latter two feeds into 
the J/ V, accounting for about 40% of its yield. The radii derived from Eq. (7.68) 
are 0.25, 0.36 and 0.45 fm, respectively, well below hadron size at least for the J / Y 
and x. states. 

With increasing temperature, o (T) decreases, and at deconfinement o (Te) = 0. 
For T > T, we thus expect 


V(r) = - exp [—r/rp (T)] (7.69) 


where rp(T) is the QCD Debye screening radius. It was initially estimated from 
a SU (2) gauge theory of thermal gluons [41], to amount to about 0.2-0.3 fm at 
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T/T, = 1.5. In this picture, the screened potential (Eq. (7.69)) can still give rise 
to bound cc states provided their radius is smaller than rp. The pioneering study 
of Matsui and Satz [41] concluded that screening in a QGP system would dissolve 
the J/ V, or its cc precursor, at T > 1.3 Tc whereas the xc and Y’ states would be 
suppressed already directly above Te. 

The corresponding energy density for J/ Y suppression, employing 


efec ~ (T/ T.)^ ~ 2.9 (7.70) 


(obtained with e; œ% 1 GeV/fm? from lattice QCD), would thus amount to about 
2.9 GeV/fm?. This motivated an extensive experimental effort at the CERN SPS Pb 
beam at ./s = 17.3 GeV. We have seen in Sect. 7.2 that the Bjorken estimate [45] 
of average transverse energy density reached in central Pb+Pb collisions [43, 44] 
amounts to e = (3.0 + 0.6) GeV/fm?, with higher € to be expected in the interior 
fireball sections: encouraging conditions. 

However, the above approach assumes the validity of a two-body potential 
treatment at finite T, near a conjectured critical point of QCD. More recently the 
quarkonium spectrum was calculated directly in finite temperature lattice QCD 
[232], with the striking result that the J/W dissociation temperature in a realistic 
non-perturbative treatment of the QCD plasma state moves up to about T = 2 Ty, 
whereas x, and V/' dissociation is expected [230] to occur at T = (1.1—1.2)7;. 

In addition to high T breakup of cc or J/ V, we have to take account of the 
so-called “normal suppression" of charmonium yields, observed in proton-nucleus 
collisions [233]. This effect is due to a re-scattering dissociation of the primordially 
produced, pre-hadronic cc system upon traversal of (cold) hadronic matter [234]. It 
can be studied in p+A collisions where the data on J/ V production relative to pp 
collisions can be described by the survival probability 


Spa = = [eo [oc pad, z) exp {—(A - Df dz’ pab, zou 
: (7.71) 


where oap; is the effective cross section for the “absorption” (break-up) of the cc in 
cold nuclear matter, and p4 is the transverse nuclear density profile. The data [233] 
suggest Ogbs = 4.2 mb. The generalization of Eq. (7.71) to the nucleus-nucleus 
case [235] gives a good description of the J/ V suppression (relative to binary pp 
scaling) in S+U and peripheral Pb+Pb collisions at top SPS energy [226]. It has thus 
become customary to quantify the J/ V suppression in central A+A collisions by 
relating the observed yield, not directly to the scaled pp yield (thus obtaining Raa), 
but to a hypothetical “normal absorption" yield baseline, established by Eq. (7.71). 
All further absorption is called “anomalous suppression". 

Figure 7.54 shows the results gathered at ./s = 17.3 GeV by the NA38- 
NAS50-NA60 di-muon spectrometer [236], for minimum bias S+U (./s = 20 GeV), 
In+In and Pb+Pb. Up to Npat ^ 100 all yields gather at the “normal absorption” 
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Fig. 7.54 J/W production 
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expectation from p+A scaling. A plateau at 0.8 follows for intermediate Npar values 
up to about 200 (which corresponds to central In+In collisions, so the NA60 data 
end here), and a final falloff toward 0.6 for central Pb+Pb collisions. It appears 
natural to interpret the former step as an indication of V' suppression, the final step 
as xc dissociation. No genuine J / V suppression is indicated. The expectation from 
lattice calculations [232] that J/ V dissociation does not occur until T ~ 2 Te (and, 
thus, € & 16 €c) is thus compatible with Fig. 7.54. We know from Eq. (7.70) that 
T < 1.3 T; at top SPS energy. The data are thus compatible with no break-up of the 
J/ V at the SPS, unlike at top RHIC energy where one expects T ~ 2T, [61, 96]. 

The RHIC data obtained by PHENIX [237] are shown in Fig. 7.55. Minimum 
bias Au+Au collisions are covered at mid-rapidity, as well as at 1.2 « y < 2.2, 
and plotted vs. Npart. Due to a parallel measurement of J/ production in p+p 
collisions [238] the PHENIX experiment is in the position to show Raa, but this 
is done without re-normalization to p-A absorption. J/ WV is suppressed in central 
collisions, Raa < 0.2. Note that Raa cannot drop down below about 0.15, due 
to unsuppressed surface contributions. The suppression is thus stronger than at top 
SPS energy?—in fact it is almost maximal. We conclude that in central Au+Au at 
A/s — 200 GeV the charmonium signal gets significantly quenched, in accord with 
the inferences about the primordial temperature that we presented in Sects. 7.2.5 
and 7.4, to amount to about 300 MeV, i.e. T / Te ~ 2 as implied for J / V dissociation 
by the lattice calculations [232]. 


3Dropping the unfortunate distinction between normal and anomalous absorption one gets Raa = 
0.35 for the central Pb+Pb collisions in Fig. 7.54, almost a factor 2 above the RHIC value. 
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The above interpretation is still a matter of controversy [239]. It remains unclear 
whether successive, distinct stages of normal and anomalous J/W suppression are 
compatible with the dynamical evolution in central A+A collisions. A further open 
question refers to the difference in Fig. 7.55 of the (y) = 0 and (y) = 1.7 data at 
intermediate centrality [239] interpretation of which should be suspended until the 
advent of higher statistics data. 

A different aspect of charmonium production in A+A collisions requires atten- 
tion: at the stage of universal hadronization a certain fraction of bound cc mesons 
results, as the outcome of the density of uncorrelated c and c quarks in the QGP 
medium as it enters hadronization [240, 241]. The stage of statistical hadronization 
(recall Sect. 7.3) is omitted in the charmonium suppression models [41, 230, 234, 
235], which proceed in two steps (only): primordial nucleon-nucleon collisions 
produce an initial input of cc pairs, proportional to 51 x NAA (b). In vacuum, 
the low relative momentum fraction of these cc pairs (about 1%) would evolve into 
charmonium hadrons, after a formation time of several fm/c. In the concurrently 
developing QGP medium, however, the initial cc correlated pairs may dissociate 
owing to the absence (due to color screening [41]) of vacuum-like bound states 
[232], at high T. At T > 2 T; all fireball charmonium production would thus cease, 
all cc pairs ending up in singly charmed hadrons. This picture [230] is incomplete. 

Even if all cc pairs break up at RHIC during the early phase of central collisions, 
the single charm and anti-charm quarks flow along in the expanding medium of 
equilibrated light and strange quarks. This picture is supported [94] by the observed 
elliptic flow of charm merging into the universal trend. Charm cannot, however, 
be chemically (flavor) equilibrated at the low plasma temperature, where it is 
essentially neither newly created nor annihilated [241]. The initially produced c and 
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c quarks (a few per central Au+Au collision at RHIC, a few tens at LHC) finally 
undergo statistical hadronization at T = T, along with all other light and strange 
quarks. To deal with the non-equilibrium overall charm abundance an extra charm 
fugacity factor ye is introduced into the statistical model [108] calculation (for 
details see [241]). J/ XV and W’ are thus created in non-perturbative hadronization, 
with multiplicities proportional to y and phase space weights, along with all other 
charmed hadrons. This "regeneration" model also agrees with the RHIC data of 
Fig. 7.55, albeit within a large systematic uncertainty [241 ]. 

We note that the term regeneration is, in fact, misleading. The statistical 
hadronization process does not recover the initial, small fraction of correlated cc 
pairs that would end up in J/W in vacuum. It arises from the total density of 
primordially produced c and c, uncorrelated in the hadronizing fireball volume. 

The statistical hadronization J/ production process, sketched above, thus 
has the unfortunate property of providing a trivial background charmonium yield, 
unrelated to the deconfinement signal [41] referring to the primordial J/W yield. 
Only about 1% of the primordial cc yield results in charmonia, in vacuum. The 
in-medium deconfinement process breaking up the cc correlation on its way to 
charmonia, thus constitutes a mere 1% fraction of the total charmed quark and 
anti-quark number. The regeneration process is insensitive to this 196 fraction, 
deconfined or not. At 7;, charm hadronization reacts only to the total abundance 
of c and c, as imprinted into the dynamical evolution by the perturbative QCD cc 
production rate of initial nucleon-nucleon collisions. At RHIC, it turns out [241] that 
the c and c density is low, giving rise to substantial canonical suppression (recalling 
Eqs (7.38)-(7.42) in Sect. 7.3) of the two charm quark charmonia, relative to D 
mesons, during hadronization. With a tenfold c, c density at LHC, grand canonical 
charmonium production will set in, thus probably overshooting the primordial yield 
reference, onl x Neo. Thus we expect Raa > 1 at the LHC. The role of a critical 
deconfinement “thermometer” is lost for J/ Y at LHC, but the bottonium Y states 
can take over, being deconfined well above T = 300 MeV [242]. 

The RHIC result [237] for J/ V in central Au+Au collisions (Fig. 7.55), namely 
that Raa — 0.2, represents the lucky coincidence that the initial temperature, T ~ 
300 MeV, is high enough to dissolve the correlated cc charmonium precursor states, 
while the J/W suppression is not yet overshadowed by the trivial hadronization 
yield of J/W. 


7.6.2 Direct Photons 


Photons are produced during all stages of the dynamical evolution in A+A colli- 
sions. About 98% stem from final electromagnetic hadron decays, not of interest 
in the present context, other then by noting that their rate has to be painstakingly 
measured experimentally, in order to obtain “direct” photon spectra at low pr by 
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subtracting the decay fraction from the total. This was done [224] by WA98 in 
central Pb--Pb collisions at top SPS energy; we show their result in Fig. 7.56. 

Only upper limits could be obtained at pr < 1.5 GeV/c due to overwhelming 
background from 7? and n decay, thus also obscuring the major spectral domain 
in which to look for a direct photon QCD plasma black body radiation source, 
i.e. for thermal photons of a fireball at T between T, and about 250 MeV [223]. 
Several data from p+p and p+A collisions at nearby ./s and scaled up to central 
Pb+Pb are included in Fig. 7.56, at pr > 2 GeV/c, and one sees the Pb+Pb data 
in clear excess of such contributions from primordial bremsstrahlung and hard, 
pQCD initial partonic collisions [243]. This excess ranges up to about 3.5 GeV/c, in 
Fig. 7.56. Above, the hard pQCD initial collision yield dominates [244] over thermal 
production. 

In contrast to all other primordial high pr pQCD yields (e.g. J/ V, charm, jet 
leading partons) this photon yield is not attenuated in the medium of A+A collisions. 
We have shown in Fig. 7.44 the Raa for the PHENIX central Au+Au direct photon 
results [181] at RHIC ~s = 200 GeV, obtained in a background substraction 
procedure [245] similar to the one undertaken by WA98. This procedure gives 
reliable data at pp > 4.0 GeV/c, at RHIC, and we observe Raa = 1. Hard 
initial photons are not attenuated, and there is no sign of any other direct photon 
contribution besides the primordial pQCD yield which, in fact, is shown (by Raa = 
1) to obey binary scaling. However, there is no hint to plasma thermal radiation 
(except for a trend at the very limit of statistical significance, at pp < 4.5 GeV/c) in 
this high pr window. 
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The WA98 SPS data, with thermal radiation enhancement indicated in the 
interval 1.5 « pr < 3.5 GeV/c, thus remained as the sole evidence until, 
more recently, the PHENIX experiment gained low pr data [246] exploiting the 
fact that any source of real photons emits also virtual photons y* leading to 
internal conversion to an ete” pair (the Dalitz effect). To identify this yield the 
invariant mass distribution of e^ e^ pairs is analyzed outside the phase space limits 
of 2° Dalitz decay; the decay pairs of all remaining hadron sources (n, A) is 
subtracted as a “cocktail”. The remaining pair yield is then converted assuming 
Vdir/ Vinclusive = Ydir/ Yinclusive (See ref. [246] for detail), thus finally obtaining data 
representative of ydir in this approach. Figure 7.57 shows the corresponding pr 
distribution which covers the interval 1.3 < pr < 4.5 GeV/c, within which 
the conventional direct photon extraction method did not give significant results 
[181]. The PHENIX experiment has also obtained direct photon spectra in p+p 
and d+Au at ./s = 200 GeV [247] which are both well accounted for [246] by 
a next to leading order (NLO) pQCD photon production model [248]. These data 
were already employed in deriving Raa = 1 for central Au+Au collisions, as 
shown in Fig. 7.44 and referred to, above. The pQCD fits derived from p+p and 
d+A are shown in Fig. 7.57 after binary scaling to Au+Au (pQCD x TAA). They 
merge with the yield at pr > 4 GeV/c but demonstrate a large excess yield below 
3 GeV/c. That excess is well described by adding a thermal photon component 
resulting from the hydrodynamic model of d'Enterria and Peressounko [249]. It 
traces the dynamical evolution during the early stages of equilibrium attainment, 
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in which the photon luminosity of the emerging QGP matter phase is maximal. 
The hydrodynamic model provides for the space-time evolution of the local photon 
emission rate [223] which includes hard thermal loop diagrams to all orders, and 
Landau-Migdal-Pomeranchuk (LPM) in-medium interference effects. Similar, in 
outline, to previous models that combined algorithms of plasma photon radiation 
luminosity with hydrodynamic expansion [250], the model [249] fits the data in 
Fig. 7.57. It implies a picture in which the early stage of approach toward thermal 
equilibrium at RHIC is governed by a symbolic, initial, effective "temperature" 
of about 550 MeV which, after equilibration at t ~ 0.6 fm/c, corresponds to 
T ~ 360 MeV in the primordial plasma [249]: close to the consensus about initial 
T as derived from J/W suppression, jet attenuation, elliptic flow and transverse 
energy coupled with the 1-dimensional Bjorken expansion model. 

However, we note that the employed theoretical framework, hard thermal loop 
(HTL) QCD perturbation theory of a weakly coupled plasma state, as combined 
with hydrodynamics, has a tendency to call for rather high initial T values. This 
applies both to the above analysis [249] of RHIC data which is based on the thermal 
field theory model of Arnold, Moore and Yaffe [223], and to previous analysis 
[250] of the WA98 SPS data of Fig. 7.56. Direct photon production is even more 
strongly biased toward the primordial, high T evolution than jet attenuation (that 
may be proportional to T? [212]). Thus, by implication of the model [249] that 
fits the low pr RHIC data of Fig. 7.57, the yield is highly sensitive to the (pre- 
equilibrium) formation period, 0.15 < t < 0.6 fm/c, where the HTL model might 
not be fully applicable. This illustrates the present state of the art. The model(s) 
based on perturbative QCD require extreme initial "temperatures" to produce the 
high photon yield, indicated by the RHIC experiment. The strongly coupled nature 
[213] of the non perturbative local equilibrium QGP state at RHIC, T ~ 300 MeV, 
may provide for an alternative approach to plasma photon production. 


7.6.3 Low Mass Dilepton Spectra: Vector Mesons In-medium 


We have dealt with dilepton spectra throughout the above discussion of J/W and 
direct photon production, as messenger processes sensitive to the energy density 
prevailing (during and) at the end of the primordial equilibration phase. The third 
tracer observable, low mass vector meson dilepton decay in-medium, samples—on 
the contrary—the conditions and modalities of hadron deconfinement in the vicinity 
of T = Ty. SPS energy is ideally suited for such studies as the QGP fireball is 
prepared near T, whereas, at RHIC, it races through the Te domain with developed 
expansion flow. The major relevant data thus stem from the CERN SPS experiments 
NA45 [251] and NA60 [136], which have analyzed e*e^ and u* u^ production at 
invariant mass from 0.2 to 1.4 GeV. 

Figure 7.58 shows NA45 data [251] for e*e^ production in central Pb+Au 
collisions at ys = 17.3 GeV. In searching for modifications of @ properties 
and of mtx annihilation via virtual intermediate o decay to ete~, in the high 
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Fig. 7.58 Di-electron mass spectrum for central Pb+Au collisions at ./s = 17.3 GeV with the 
hadron decay cocktail (left) and further in medium contributions (right) [251]; see text for detail 


density environment near the hadron-parton coexistence line at T = T,, the various 
background sources have to be under firm control. The Dalitz decays of 1°, n 
and z' and the in vacuo e*e~ decays of o, w and 6, which occur after hadronic 
freeze-out to on-shell particles, form a hadronic "cocktail" (left panel in Fig. 7.58) 
that is generated from yields provided by the grand canonical statistical model 
[108]. Within detector resolution, zr, œ and ® leave distinct peaks but the observed 
invariant mass distribution is not accounted for. 

One needs also to account for background from Drell-Yan lepton pair production 
and open charm decay, both scaling with “number of collisions" A^/. The latter 
contribution arises from primordial cc charm production, (c) — (c) which leads to 
synchronous formation of (D) — (D) at hadronization; subsequent decays D — 
lepton + X, D — antilepton + Y create LL pairs. This procedure is straight 
forward as no significant medium attenuation occurs besides the statistical charm 
redistribution conditions at hadronization (Sect. 7.6.1), governing D, D production 
[241]. 

Onward to non-trivial backgrounds in the invariant mass plot of Fig. 7.58, we 
recall the presence of thermal lepton pairs from virtual photon production in the 
early plasma phase, in parallel to real “direct” photon emission [252]. The spectrum 
of such pairs contains an average decay factor exp (Mij/ T), with T the (initial) 
plasma temperature. With T > 220 MeV assumed for top SPS energy [43, 44], 
this contribution is a background candidate over the entire invariant mass interval 
covered by the data. In general Drell- Yan, open charm decay and plasma radiation 
contributions are smooth, partially closing the "holes" in the hadronic cocktail 
undershoot of the data. This smoothing effect is helped, finally, by consideration 
of modifications concerning the o meson spectral function near T = Te, which 
[133-135] both affects the immediate 9 — e*e~ decay invariant mass region 
(through the fraction of in-medium @ decays vs. the in vacuum decay fraction after 
hadronic freeze-out) and, even more importantly, the contribution of in-medium 
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z^z annihilation to dileptons. The latter contribution accounts for the most 
obvious deviation between cocktail and data at 0.3 < mj; < 0.7 GeV in Fig. 7.58 
(left panel). 

The right hand panel of Fig. 7.58 shows the results of various theoretical models 
which address the sources of the significant dilepton excess over hadronic (in 
vacuum) and Drell-Yan cocktails, labeled *Rapp-Wambach" [133, 135], “dropping 
mass" (Brown-Rho [134]) and *Kaempfer" [252]. We shall return to these models 
below but note, for now, that the extra yield in central A+A collisions chiefly 
derives from z'*zr^ annihilation via the (in medium modified) o resonance, and 
from modification of the o peak itself. 

With improved statistics and background control, the A+A specific extra dilepton 
yield below M 1.4 GeV/c? can be represented by itself, after cocktail subtraction. 
This has been first accomplished by NA60 [136, 253] and, more recently, also by 
NA45 [254]. We show the former results in Fig. 7.59. The left panel shows the 
di-muon invariant mass spectrum in semi-central Indium-Indium collisions at top 
SPS energy ys = 17.3 GeV, compared to the hadronic cocktail, qualitatively 
in agreement with the left hand panel of Fig. 7.58 but with superior resolution 
and statistics. The cocktail subtraction procedure (see [136] for details) leads to 
an invariant mass spectrum of di-muon excess in In+In, shown in the right side 
panel of Fig. 7.59: an experimental landmark accomplishment. The o vacuum decay 
contribution to the hadronic cocktail has been retained and is shown (thin solid 
line) to be a small fraction of the excess mass spectrum, which exhibits a broad 
distribution (that is almost structureless if the cocktail o is also subtracted out), 
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Fig. 7.59 (Left) Di-muon invariant mass spectrum in semi-central In+In collisions at ./s = 
17.3 GeV, with hadron final state decay cocktail. (Right) Excess mass spectrum after cocktail 
subtraction, confronted with fits from the broadening [133, 135, 255] and the dropping mass [228] 
models. From [136] 
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widening with collision centrality [136]. The best theoretical representation of the 
excess yield again results (like in Fig. 7.58, right panel) from the broadening model 
[133, 135, 255] where the o spectral function is smeared due to various coupling 
mechanisms within the medium via the vector dominance model, prior to hadronic 
freeze-out. In the VDM ọ couples, both, to pion pair annihilation, and to excited 
baryon states like the N* (1520), via their Nara decay branches. 

The above data still imply serious conceptual questions. It is unclear to exactly 
which stage of the dynamical evolution (in the vicinity of T = T,) the excess 
dilepton yield should correspond. As the observations appear to be coupled to in- 
medium o meson “metabolism” we need to identify a period of temporal extension 
above the (in vacuo) o half life (of 1.3 fm/c), safely 2 fm/c. This period should 
be located in the vicinity of hadro-chemical freeze-out. From top SPS to RHIC 
energy, hadro-chemical freeze-out should closely coincide with hadron formation, 
ie. it should occur near the parton-hadron coexistence line, at T = Te. The 
microscopic parton cascade model of reference [85] implements the Webber [121] 
phenomenological, non-perturbative QCD hadronization model (recall Sect. 7.3.3) 
which proposes pre-hadronization clusters of color neutralization (Fig. 7.31) as the 
central hadronization step. In it, so one might speculate, the transition from pQCD to 
non perturbative QCD creates the chiral condensates (qq), spontaneously breaking 
chiral symmetry [256] (see below), and creating hadronic mass. The overall process, 
from pQCD color neutralization to on-shell hadrons, takes about 2.5 fm/c [85, 86] 
at top SPS energy. This could, thus, be the period of excess dilepton yield creation. 
However, the relation of the models employed above [133—135, 255, 256] to this 
primordial spontaneous creation of chiral condensates is still essentially unknown 
[256]. 

Thus, at present, the 1990s paradigm of a direct observation of the chiral phase 
transition in QCD has been lost. The Brown-Rho model [134] predicted the o mass 
to drop to zero at T = Te, occurring as a certain power of the ratio (qg)"*4 / (qq) 
of the chiral condensate in medium and in vacuum which approaches zero at 
the chiral phase transition temperature, then expected to coincidence with the 
deconfinement temperature. This “dropping mass" model is ruled out by the data 
in Fig. 7.58 and 7.59. This is, perhaps, a further manifestation of the fact that the 
deconfined QGP state at T > Te is not a simple pQCD gas of quarks and gluons 
[213]. In fact, lattice calculations [232, 257] find indications of surviving light qq 
pair correlations in the vector channel at T > Te. Thus the two most prominent 
symmetries of the QCD Lagrangian, non abelian gauge invariance (related to 
confinement) and chiral invariance (related to mass) might exhibit different critical 
patterns at T = Te and low baryo-chemical potential. This conjecture is best 
illustrated by the observation that the broad, structureless NA60 excess dilepton 
spectrum of Fig. 7.59 (after cocktail o subtraction) is equally well reproduced by a 
T ~ 160-170 MeV calculation in hadronic (equilibrium) matter [133, 253, 254], 
and by a thermal QGP fireball of qq annihilation at this average temperature [252], 
as illustrated here by the model curve labeled “Kaempfer” in Fig. 7.58 (right panel). 
This observation has invited the concept of parton-hadron duality near T, [258], 
which might be provocatively translated as “the QCD chiral transition properties 
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cannot be unambiguously disentangled from the deconfinement transition effects at 
T." [256]. 

We may be looking at the wrong domain of [T , upg] space: too high ./s and, thus, 
too high T, too low ug. Already at top SPS energy the medium is dominated by the 
deconfinement, not by the chiral QCD transition. After hadronization the medium 
is still at T close to T, but the density drops off within a few fm/c, not allowing for 
an equilibrium mean field state of chirally restored hadrons. It is thus perhaps not 
surprising that the data are seen to be dominated by simple broadening and lack of 
structure: perhaps that is all that happens to hadrons at Te. 

The chiral restoration transition should thus be studied at higher ug and lower 
T such that the dynamics achieves high baryon densities but still merely touches 
the critical (deconfinement) temperature. In fact, at ug. — 1 GeV and T < 
100 MeV the chiral transition should be of first order [259]. Here, in fact, the 
chiral condensate mass plays the role of the order parameter (in analogy to the 
magnetization in a spin system), which approaches zero as T — Te. We might thus 
speculate that, unlike at top SPS to LHC energy (where deconfinement dominates), 
the chiral QCD first order phase transition will dominate the phenomena occurring 
near the hadron-parton borderline, in the vicinity of ./s = 4-6 GeV [142]. This 
requires a new experimental program, with low energy running at the SPS [260], at 
RHIC [261] and at the GSI FAIR project [262]. 


7.7 Fluctuation and Correlation Signals 


Fluctuation and correlation signals in A+A collisions can be evaluated in single 
events, due to the high multiplicity of produced particles. Depending on the physics 
context we may be interested to see either a small, or a large nonstatistical fluc- 
tuation effect. For example in view of the universality of hadronization (Sect. 7.3) 
we would have difficulty with an event by event pion to baryon ratio (essentially 
Ug ! fluctuating by, say, 50%. Conversely, searching for critical fluctuations in the 
vicinity of a predicted critical point of QCD [146, 147] we would be frustrated 
if event-wise (pr), d N/d y (low pr pion) [263] or strange to non-strange ratios 
like K/z [148, 264] would not exhibit any significant fluctuation beyond statistics. 
Overall, event by event fluctuation observables also carry a message concerning 
the robustness of our assumptions about equilibrium attainment. It turns out that 
equilibrium properties are not, merely, central limit consequences of ensemble 
averaging. Rather to the contrary, each A * 200 central collision event at ./s > 
10 GeV appears to offer, to the dynamical evolution of bulk properties, a sufficiently 
complete macroscopic limit. Such that we can view the event-wise bulk dynamics 
as "self analyzing". 
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7.7.1 Elliptic Flow Fluctuation 


That this ascertations is far from trivial is illustrated in Fig. 7.60. It shows [265] 
the primordial transverse energy density projection (the stage basic to all primordial 
observables) at £ ~ 0.2 fm/c, in a central Au+Au collision at ys = 200 GeV, 
exhibiting an extremely clumpy and nonhomogeneous structure, apparently far 
away from equilibrium. 

The imaging of this initial geometrical pattern of the collision volume by a 
hydrodynamic evolution even applies at the level of individual events, such as 
illustrated in Fig. 7.60. The PHOBOS Collaboration has shown [266-268] that an 
event by event analysis of the elliptic flow coefficient v» is possible (see ref. [266] for 
detail), by means of a maximum likelihood method. For initialization they sample 
the seemingly random initial spatial density distribution of single Au+Au collision 
events by the "participant excentricity" of individual Monte Carlo events, 


(o? — 02)? + 402, 
———————— (7.72) 


€ = 
part 2 2 
Oy + Oc 


where oxy = (xy) — (x) (y). The average values of €par turn out to be similar 
to €, from Eq. (7.47), as expected, but the relative fluctuation width o (e)/ (€) part 
turns out to be considerable. It is the point of this investigation [267, 268] to show 
that the observed relative event by event flow fluctuation equals the magnitude of 
the relative excentricity fluctuation. This is shown [268] in Fig. 7.61. The left panel 
demonstrates that the average (v2) obtained vs. Npar from the event-wise analysis 
agrees with the previously published [156] event-averaged PHOBOS data. The right 
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Fig. 7.61 (Left) Event by event elliptic flow analysis by PHOBOS gives an average (v2) that 
agrees with the result of ensemble analysis, in Au+Au at 200 GeV, for charged hadrons. (Right) 
Event-wise relative v2 fluctuation vs. Npart, compared to the event-wise relative fluctuation of the 
participant excentricity, o (€part)/ (€part)- The closed line gives vz variation due to Npart number 
fluctuation. From [268] 


panel shows that the event-wise relative fluctuation of v» is large: it amounts to about 
0.45 and is equal to the relative fluctuation of epar, i.e. 


o (v2)/ (v2) © 0 epar / (epar) - (7.73) 


The initial geometry appears to drive the hydrodynamic evolution of the system, 
not only on average but event-by-event [268], thus providing for an example of the 
self-analyzing property mentioned above. The v2 signal thus emerges as the most 
sensitive and specific diagnostic instrument for the primordial conditions and their 
subsequent evolution: it reveals even the random (Fig. 7.60) initial fluctuations. In 
comparison the analysis with thermal photons is only sensitive to the primordial 
temperature [249], and restricted by very small cross sections and significant 
background from other sources and evolution times. It also does not give viscosity 
information. 


7.7.2 Critical Point: Fluctuations from Diverging 
Susceptibilities 


Recalling the goal defined in the introduction we seek observables that help to 
elaborate points or regions of the QCD phase diagram, Fig. 7.1. We have seen 
several observables that refer to the QCD plasma environment at T > 300 MeV, 
u ~ 0 (elliptic flow, jet attenuation, J/ V suppression, direct photon production), 
which could be first resolved within the primordial time interval t < 1 fm/c 
accessible at RHIC energy. The LHC will extend the reach of such observables 
toward T ~ 600 MeV, xp < 1073, at uB = O. On the other hand, relativistic A+A 
collisions at lower energy permit a focus on the hypothetical QCD parton-hadron 
coexistence line, T = T, (up), with the domain ug — 500 MeV being accessible 
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at the SPS. Characteristic observables are radial flow, hadro-chemical freeze-out, 
and chiral symmetry restoration effects in dilepton vector meson spectra. Focusing 
on this domain, we discuss fluctuations potentially associated with the existence of 
a critical point (8-11, 146, 147]. 

At the end of Sect. 7.6.3 we mentioned the conclusion from chiral symmetry 
restoration models [15, 259] that at high wp the phase transformation occurring at 
Tc (ug) should be a chiral first order phase transition. On the other hand, lattice 
QCD has characterized [16] the phase transformation at ug — 0, to be merely a 
rapid cross-over. Thus, the first order nature of the phase coexistence line in Fig. 7.1 
has to end, with decreasing ug, in a QCD critical point, tentatively located by recent 
lattice QCD calculations [9-11] in the interval ug = 300-500 MeV. The existence 
of such a point in the [7T, ug] plane would imply fluctuations analogous to critical 
opalescence in QED [146, 147, 263]. Beyond this second order phase transition 
point the coexistence line would be the site of a rapid cross-over [16]. This overall 
theoretical proposal places potential observations related to the critical point itself, 
and/or to the onset of first order phase transition conditions at higher ug, within the 
domain of the lower SPS energies, /s < 10 GeV. Note that, at such low energies, 
the initialization of thermal equilibrium conditions should occur in the vicinity of 
Tc, unlike at RHIC and LHC, and that the central fireball spends considerable time 
near the coexistence line, at 300 < ug < 500 MeV. 

To analyze potential observable effects of a critical point, we recall briefly the 
procedure in finite upg lattice theory that led to its discovery. One method to compute 
thermodynamic functions at ug > 0 from the grand canonical partition function 
Z(V, T, uq) at uq = 0 is to employ a Taylor expansion with respect to the chemical 
quark potential [10, 11, 270], defined by the derivatives of Z at u = 0. Of particular 
interest is the quark number density susceptibility, 


52 
w=" (Sa s) Uu 


which can also be written as 


=T? E A EE D ny + nq 
mad non Y irn) T3 (7.75) 


with xq. = (Xu + xa)/2 and quark number densities ny, na. We see that the 
susceptibility refers to the quark number density fluctuation. The lattice result 
[10, 270] is shown in Fig. 7.62, a calculation with two dynamical flavors assuming 
T. = 150 MeV and three choices of chemical quark potential, uq = 0,75 and 
150 MeV, respectively, corresponding to ug = 3 uq = 0, 225 and 450 MeV. These 
choices correspond to LHC/RHIC energy, top SPS energy and ys & 6.5 GeV, 
respectively. At ug = 0 one sees a typical smooth cross-over transition at T = T; 
whereas a steep maximum of susceptibility occurs with ug = 450 MeV. This 
suggests the presence of a critical point in the (T, ug) plane [270] in the vicinity of 
(150 MeV, 450 MeV). For final confirmation one would like to see this maximum 
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disappear again, toward 4q > Te, but this is beyond the convergence domain of the 
employed Taylor expansion (see ref. [9] for alternative approaches). 

From Fig. 7.62 we also expect a divergence of the strangeness susceptibility, for 
which no results from a 3 dynamical flavors calculation at finite ug exist to date. 
A lattice calculation at ug = 0, T ~ 1.5 T; suggests [271] that the u, d, s quark 
flavors densities fluctuate uncorrelated (but we do not know whether that is also 
true at ug = pe This could thus be observed in event by event analysis, in 
particular as a fluctuation of the Wroblewski ratio A, = 2(s + s)/(u +u +d + d) 
which is approximated by the event-wise ratio (K* + K )/(t^ + x ). This was 
first measured by NA49 in central collisions of Pb+Pb at top SPS energy; the result 
[272] is shown in Fig. 7.63. The data result from a maximum likelihood analysis 
of track-wise specific ionization in the domain 3.5 < y < 5 slightly forward of 
mid-rapidity. The width odata is almost perfectly reproduced by the mixed event 
reference, such that the difference, 


ayn = Ohta = 02) (1.16) 


amounts to about 3% of ddata only, at ./s = 17.3 GeV. This analysis has more 
recently been extended to all energies available thus far, at the SPS [273] and at 
RHIC [274]. Figure 7.64 shows that ogyn stays constant from top SPS to top RHIC 
energy but exhibits a steep rise toward lower energies that persists down to the 
lowest SPS energy, ./s = 6.2 GeV. Figure 7.33 shows [107] that at this energy 
uB = 450 MeV, thus corresponding to the susceptibility peak in Fig. 7.62. Again, 
as we noted about the peak in Fig. 7.62: if these data indicate a critical point effect in 
the vicinity of ug = 450 MeV the relative fluctuation should decrease again, toward 
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yet higher upg and lower ./s. These data will, hence, be re-measured and extended to 
lower ./s by experiments in preparation [260—262] at CERN, RHIC and GSI-FAIR. 
This will also help to evaluate alternative tentative explanations invoking fluctuating 
canonical suppression [120], or strangeness trapping [275]. Finally, the position of 
the critical point needs to be ascertained by lattice theory. 
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7.7.3 Critical Fluctuation of the Sigma-Field, and Related 
Pionic Observables 


Earlier investigations of critical QCD phenomena that might occur in high energy 
nuclear collisions were based on QCD chiral field theory [276]. The QCD critical 
point is associated with the chiral phase transition in so far as it appears as a remnant 
of a tri-critical point [147] corresponding to the “ideal” chiral limit that would occur 
if m, = mq = 0. Therefore the existence of a second-order critical point, at ug > 0, 
is a fundamental property of QCD with small but non-zero quark masses [277]. The 
magnitude of the quark condensate, which plays the role of an order parameter of 
the spontaneously broken symmetry (generating hadronic mass), has the thermal 
expectation value 


1 
(qd) = 7 5 ("|74 | nyexpC- En/T) (7.77) 


n 


with the partition function of hadronic states E, 


Z =) exp( - E./T). (7.78) 


The low energy behavior of the matrix elements (n | qq | n) can be worked out 
in chiral perturbation theory [277]. At the QCD critical point the order parameter 
fluctuates strongly. Its magnitude (gq) is identified with an isoscalar quantity, the 
so-called o-field. The critical point communicates to the hadronic population via 
the o < xm reaction, generating fluctuating fractions of the direct pion yield 
present near T = T,, which thus gets imprinted with a fluctuation of transverse 
momentum (in the low pr domain) stemming from o mass fluctuation, downward 
toward the critical point. At it the isoscalar field ideally approaches zero mass, in 
order to provide for the long wavelength mode required by the divergence of the 
correlation length [147]. 

Note the relatively fragile structure of the argument. In an ideal, stationary 
infinite volume situation the sigma field would really become massless, or at least 
fall below the z^ threshold; thus its coupling to mtr becomes weak, and 
restricted to very small pr. Furthermore, such primary soft pions, already small 
in number, are subject to intense subsequent re-absorption and re-scattering in the 
final hadronic cascade evolution [114]. In fact, experimental investigations of event 
by event pr fluctuations in central A+A collisions, covering the entire J/s domain 
from low SPS to top RHIC energy have not found significant dynamic effects [278— 
281]. Figure 7.65 illustrates the first such measurement by NA49 [278] in central 
Pb+Pb collisions at ./s = 17.3 GeV, at forward rapidity 4 < y < 5.5, showing the 
distribution of event-wise charged particle average transverse momentum, a perfect 
Gaussian. It is very closely approximated by the mixed event distribution, ruling 
out a significant value of odyn from Eq. (7.76). More sensitive measures of changes, 
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event by event, in the parent distribution in transverse momentum space, have been 
developed [282, 283]. NA49 has employed [278, 279] the measure ® (pr), defined 


as [282] 
Iu = 
®(pr) = Zi = Je (7.79) 


where z; = pri — Pr for each particle, with Pyr the overall inclusive average, and for 
each event Z = » y zi is calculated. With the second term the trivial independent 
particle emission fluctuation is subtracted out, i.e. ® vanishes if this is all. Indeed, 
the data of Fig. 7.65 lead to compatible with zero. Furthermore, a recent NA49 
study [279] at mid-rapidity, covers the range from ys = 17.3 to 6.3 GeV (where 
the K /7 ratio fluctuation in Fig. 7.64 exhibits the much-discussed rise, and even 
the ensemble average in Fig. 7.28 shows the unexplained sharp peak) but finds no 
significant signal. 

Alternatively, one can base a signal of dynamical pr fluctuation on the binary 
correlation of particle transverse momenta in a given event, i.e. on the co-variance 
(pri prj) [281, 283] of particles i, j in one event. Of course, the co-variance 
receives contributions from sources beyond our present concern, i.e. Bose-Einstein 
correlation, flow and jets (the jet activity becomes prominent at high Js, and 
will dominate the pr fluctuation signal at the LHC). In co-variance analysis, 
the dynamical pr fluctuation (of whatever origin) is recovered via its effect on 
correlations among the transverse momentum of particles. Such correlations can 
be quantified employing the two-particle pr correlator [281, 284] 


n N(k) N(k) 


Y; Y Y ApriApr; (7.80) 


k=1 i=l j=it+l 


1 
M pairs 


(Apri Apri) = 
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where Mpairs is the total number of track pairs of the events k contained in the entire 
ensemble of n events, N (k) is the number of tracks in event k, and Apr; = pri—Pr 
where Pp is the global ensemble mean pr. The normalized dynamical fluctuation is 
then expressed [281] as 


a (PT)ayn = y (Apri Apr;) / PT- (7.81) 


It is zero for uncorrelated particle emission. 

Figure 7.66 shows the analysis of pr fluctuations based on the pr correlator, 
for central Pb+Au SPS collisions by CERES [280] and for central Au+Au at four 
RHIC energies by STAR [281]. The signal is at the 1% level at all ./s, with no hint 
at critical point phenomena. Its small but finite size could arise from a multitude of 
sources, e.g. Bose-Einstein statistics, Coulomb or flow effects, mini-jet-formation, 
but also from experimental conditions such as two-track resolution limits [284]. 
We note that even if a critical opalescence effect, related to a fluctuating chiral 
condensate at T = Terit, couples to the primordial, low pr pion pair population 
[147, 285], this signal might be dissipated away, and thus “thermalized” to the 
thermal freeze-out scale of about 90-110 MeV, as a pion experiences about 6 
re-scatterings during the hadronic cascade [114]. On the other hand the hadro- 
chemical K/z ratio fluctuation (Fig.7.64) would be preserved throughout the 
cascade (Sect. 7.3). 
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7.7.4 Bose-Einstein-Correlation 


Identical boson pairs exhibit a positive correlation enhancement when A pi; — 0, 
an intensity correlation analogous to the historical Hanbury-Brown and Twiss effect 
(HBT) of two photon interferometry [286] employed in astrophysics. In nucleus- 
nucleus collisions this is an aspect of the symmetry of the N-pion wave function that 
describes pion pairs at the instant of their decoupling from strong interaction. The 
momentum space correlation function C(q, K), q = pi- po, K = i( pi pa) is the 
Fourier transform of the spatial emission function S(x, K) which can be viewed as 
the probability that a meson pair with momentum K is emitted from the space-time 
point x in the freezing-out fireball density distribution [96, 287]. 

The aim of HBT two particle interferometry is to extract from the measured 
correlator C(q, K) as much information about S(x, K) as possible. In addition 
to the traditional HBT focus on geometrical properties of the source distribution 
(“HBT radii”) there occurs information on time and duration of emission, as we have 
already employed in Fig. 7.19, illustrating tf and At in central Pb+Pb collisions at 
top SPS energy [90]. Moreover, even more of dynamical information is accessible 
via the effects of collective flow on the emission function. In this respect, the 
observations make close contact to the hydrodynamic model freeze-out hyper- 
surface (Sect. 7.3.1) and the contained flow profiles. HBT thus helps to visualize the 
end of the collective radial and elliptic flow evolution. This implies that we expect 
to gather evidence for the, perhaps, most consequential property of A+A collisions 
at high ./s, namely the primordially imprinted Hubble expansion. 

We do not engage here in a detailed exposition of HBT formalism and results as 
comprehensive recent reviews are available [96, 287]. Briefly, the measured quantity 
inatat,a x or Kt Kt interferometry is the correlator 


d N/dp? dp? 
C(q, K) ==, +; (7.82) 
d"N/dp, d" N/dp; 


of pair yield normalized to the inclusive yield product. The correlator is related to 
the emission function which is interpreted as the Wigner phase space density of the 
emitting source [288]: 


| f d^x S(x, K) e* |? 


C(q, K) &1 
GST Cd SG, KP 


(7.83) 


Due to the experimental on-shell requirement Ko = v K? +m? the 4-vector 
components of K are constrained on the left hand side. Thus, relation (7.83) cannot 
simply be inverted. 
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To proceed one employs a Gaussian ansatz on either side of Eq. (7.83). The 
experimental data are parametrized by source “radius” parameters R;; (K), 


C(q, K) = 1+ A(K) exp[- 9 R} GOqiq; ] (7.84) 
ij 


employing A(K) essentially as a fudge factor (related nominally to possible 
coherence of emission effects in the source, which would dilute or remove the 
Bose-Einstein statistics effect of C — 2 for q — 0 but have never been seen in 
nuclear collisions). In Eq. (7.84) the sum runs over three of the four components of 
q, due again to the on-shell requirements [287]. For the emission function S(x, K) 
a Gaussian profile is assumed about an "effective source center" x(K), thus 


S(x, K) > Sx(K), K) x G (7.85) 


where G is a Gaussian in coordinates x“(K) relative to the center coordinates 
x" (K). Inserting Eq. (7.85) into (7.83) one finally obtains 


C(q, K) = 1-- A(K) exp [-ququ (£^ ")] (7.86) 


where (x^ X") are the elements of the space-time variance of the correlation func- 
tion, which re-interpret the “radii” R? in Eq. (7.84). Assuming azimuthal symmetry 
(central collisions), cartesian parametrizations of the pair relative momentum q 
coordinates (corresponding to fixation of the space-time variance in Eq. (7.84)) have 
been introduced by Yano, Koonin and Podgoretskii [289], and, alternatively, by Pratt 
[290]. The latter, out-side-longitudinal coordinate system has the “long” direction 
along the beam axis. In the transverse plane, the “out” direction is chosen parallel 
to Kr = (pir + por)/2, the transverse component of the pair momentum K. The 
“side” direction is then orthogonal to the out- and long-direction but, moreover, it 
has the simplest geometrical interpretation (see ref. [287] for detail), to essentially 
reflect the transverse system size [288]. The parameters of Eq. (7.86) are thus 
defined; as an example we quote, from identification of Eq. (7.84) with (7.86), the 
resulting geometrical definition of the “side” radius, 


R2,,(K) = (suo). (7.87) 
Overall, this model of Fourier related correlators, C(q, K) the experimentally 
accessible quantity (see Eq. (7.82)), and S(x, K) the to-be-inferred spatial freeze- 


out fireball configuration, leads to the Gaussian ansatz [287] 


C(q, K) = 1-- X(K) exp- Ro (qoa — Reide(K )4side — 
Riong(K)diong + cross terms] (7.88) 
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which is fitted to the experimental correlation function (Eq. (7.82)). The experiment 
thus determines the variances Rout, Rsideand Rjong. In the so-called “local co-moving 
system" (LCMS), defined as the frame in which pz,ı = —pz,2, 1.€. Plong = 0, we 
obtain in addition to Eq. (7.87) 


Roa (KO? = (GO — Br GO) 
Rig (KO = (£O?) (7.89) 


and finally, for azimuthal symmetry in central collisions with (x?) x (57) we find 


the “duration of emission” parameter illustrated in Fig. 7.19 [90]: 
-2\ A1 2 2 
(7?) x p (Rou — RÀ). (7.90) 


The resulting reduction of the initial 8 coordinates of the meson pair stems, 
in summary, from the on-shell requirement, from azimuthal symmetry and from 
approximate Bjorken invariance in the LCMS system [287, 288]. One has to 
be aware of the latter idealizations. Note that in an expanding source all HBT 
parameters depend on K, the pair mean momentum (see below). 

We make first use of the above parametrization in Fig. 7.67. From the purely 
spatial radii Rae and Rjong one can define a mid-rapidity volume at pionic 
decoupling, V; = (2z)?? R24, Riong which is shown [291] at K = 0.2 GeV 
for central Pb+Pb and Au+Au collisions from AGS [292] via SPS [90, 293] to 
RHIC [294] energy. The upper panel shows the ./s dependence, the lower illustrates 
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the dependence on the charged particle rapidity density dNen/dy which one might 
intuitively expect to be related to the freeze-out volume [295]. We see, firstly, that 
the plot vs. ./s exhibits a non-monotonous pattern at the transition from AGS to 
SPS energies [287, 295], whereas the plot vs. dN /dy rather features a rise toward a 
plateau that ends in a steep increase at RHIC energies. Second, the tenfold increase 
in charged particle rapidity density is reflected in only a doubling of the “volume” 
Vr. 

The latter observation reminds us of the fact that the geometrical parameters 
obtained via the above analysis do not refer to the global source volume if that 
volume undergoes a collective Hubble expansion [96, 287]. A pion pair emitted with 
small relative momentum into the azimuthal direction Kr is likely to stem (only) 
from the fraction of the expanding source that also moves into this direction. This 
coupling between position and momentum in the source becomes more pronounced, 
both, with increasing Kr and increasing sources transverse velocity Br from radial 
expansion. We have seen in Fig.7.24 that the latter increases dramatically with 
A/s, such that the coherence volume Vr comprises a decreasing fraction of the total 
fireball. It should thus rise much more slowly than proportional to the global dN /dy 
[96, 287]. 

A striking experimental confirmation of the Hubble expansion pattern in central 
A+A collisions is shown in Fig. 7.68. The illustrated HBT analysis of NA49 [90] 
at ./s = 17.3 GeV, and of PHOBOS [294] at ./s = 200 GeV, employs the 
alternative parametrization of the correlation function C(q, K) introduced [289] 
by Yano, Koonin and Podgoretskii (YKP). Without describing the detail we note 
that the YKP correlation function contains the “YK velocity" yg describing the 
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source's longitudinal collective motion in each interval of pion pair rapidity, 


1 Ei +E z z 
y =- In (ppe) , (7.91) 
2 Ei + E2 — pa — pz 
Defining the “YKP rapidity” corresponding to yg by 
1 1 + Évk 
Yykp = > In (= + Yem (7.92) 
2 1 — pyk 


leads to the results in Fig. 7.68, indicating a strong correlation of the collective 
longitudinal source rapidity Yyxp with the position of the pion pair in rapidity space. 

We are interested in a possible non-monotonous 4/s dependence of freeze-out 
parameters such as Vr because of recent predictions of the relativistic hydrodynamic 
model that a QCD critical point should act as an attractor of the isentropic expansion 
trajectories in the [7, upg] plane [101, 296]. Figure 7.69 shows such trajectories 
characterized by the ratio of entropy to net baryon number, S/np, which is 
conserved in each hydro-fluid cell throughout the expansion. Note that the S/ng 
ratio is determined from the EOS during the primordial initialization stage in the 
partonic phase; the relationship between S/ng and 4/s is thus not affected by the 
presence or absence of a critical end point (CEP) at T = T, which, however, 
drastically influences the trajectory at later times as is obvious from comparing the 
left and right panels, the latter obtained with a first order phase transition at all 
up but no CEP. In this case, the S/ng = const. trajectories in the partonic phase all 
point to the origin in the [T, 44] plane because uq/T œ In(S/ng) in an ideal parton 
gas; whereas they point oppositely below T = T, because T?/?/o œ In(S/ng) in 
a hadron gas. This organization is dramatically upset for the cases S/n — 100, 
50, and 33 by the assumption of a CEP, tentatively placed here at T — 155 MeV, 
uB = 368 MeV. 
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Fig. 7.69 (Left) Influence of a critical point on hydrodynamic model isentropic expansion 
trajectories characterized by various values of entropy to net baryon number s/ng. (Right) The 
same trajectories without a critical point but a first order transition all along the phase boundary 
[296] 
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A first conclusion from this model is that it should not be essential to fine-tune 
A/s to make the system pass near the CEP because the attractor character would 
dominate over a substantial domain, e.g. at hadro-chemical freeze-out conditions 
250 < ug x 500 MeV in the example of Fig. 7.69. Please note that hadro-chemical 
freeze-out is not treated correctly in this model, to occur at Ty which is 160 < Ty < 
130 MeV from Figs. 7.1 and 7.33. The trajectories shown here below Ty are thus not 
very realistic. The expected pattern could cause the "plateau" behavior of several 
observables at hadronic freeze-out over the range of SPS energies that corresponds 
to the above interval of ug and Ty, e.g. (mr) and T in Figs. 7.21 and 7.22, elliptic 
flow v2 at mid-rapidity in Fig. 7.36, and coherent hadronic freeze-out volume Vf 
from HBT in Fig. 7.67. 

A consistent search for critical point effects implies, at first, a correct treatment 
of the hadronic expansion phase in hydro-models as above [101, 296], properly 
distinguishing chemical composition freeze-out, and eventual "thermal" freeze- 
out at hadronic decoupling. From such a model predictions for the ./s or $/ng 
systematics could be provided for the HBT source parametrization implied by 
Eqs. (7.84)-(7.86). Le. the hydrodynamic model provides the correlator S(x, K) in 
cases with, and without a critical point which, as Fig. 7.69 shows, leads to consider- 
able changes in the system trajectory in the domain near hadronization and onward 
to hadronic thermal freeze-out, at each given S/n or ./s. On the experimental 
side, this trajectory is documented by [T, wp] at hadronic freeze-out from grand 
canonical analysis [140] (Sect. 7.3) which also yields S/ng. Furthermore, as we 
shall show below, HBT in combination with the analysis of pr or mr spectra will 
describe the situation at thermal hadron freeze-out yielding Tf, Br at the surface 
and the “true” transverse radius Rgeom of the source, in addition to the coherence 
volume V; illustrated in Fig. 7.67 (which also documents the present, insufficient 
data situation). Of course, we have to note that the lattice results concerning the 
critical point are not yet final [270], and neither is the hydrodynamic treatment [296] 
concerning the EOS in the vicinity of the CEP. 

We turn to combined analysis of the two final processes of expansive evolution, 
formation of pr spectra and decoupling to Bose-Einstein pair correlation, in order 
to show how the experimental information mentioned above can be gathered. At 
the level of “hydrodynamically inspired" analytic models for the emission function 
S(x, K) several approaches [106, 297] have established the connection, arising from 
radial flow, between hadronic m7 spectra and the Ky dependence of the HBT radii 
Ride, Routand Riong, Eqs. (7.87), (7.92), which fall down steeply with increasing KT 
[291—294]. A combined analysis lifts continuous ambiguities between the thermal 
freeze-out temperature 7; and the radial expansion velocity Br (for which a radial 
profile Br = (r/Rgide) Bo is assumed), that exist in the blast wave model derivation 
of both the pr spectra, Eq. (7.25), and the Ky dependence of Rg. 

This was first demonstrated in a NA49 study of pion correlation in central Pb+Pb 
collisions at ./s = 17.3 GeV [90] that is shown in Fig. 7.70. The ambiguous anti- 
correlation of fit parameters 7; and Be can be somewhat constrained if several 
hadronic species are fit concurrently. This is obvious in Fig. 7.70 from the overlap 
of parametrization regions for negative hadron, and for deuterium mr spectra. An 


424 R. Stock 


Fig. 7.70 Allowed regions of 200 7 
freeze-out temperature vs. d 2n-BE | 
radial expansion velocity for 
central Pb+Pb collisions at 80 = I 
~s = 17.3 GeV and ho 

mid-rapidity, combining 1 
negative hadron and 60 — I 
deuterium spectral data / 
analysis with BE z 
correlation results on KT 
dependence of R, 7 Rs [90] 


T [MeV] 
& 
I 


Fig. 7.71 The surface J 
velocity (here denoted as pọ) 
of radial expansion at 

decoupling in central Au+Au o es 
and Pb+Pb collisions vs. ./s. 


: : o 
From combined analysis of Qoo $ 
NE. " Oo 9 o E 
hadron pr spectra and z^ pair 8 - " o o 
correlation [291, 297] "a 
E 
& 0.6 H 
04 - m AGS (E895) 
6 SPS (NA49) 
PHENIX ( Vs 130 GeV) 


02 [- — A STAR ( Vs— 200 GeV) 


0 I ıl icd] 
10 10? 
vs [GeV] 


orthogonal constraint on p; / Tr results from a blast wave model fit of the HBT Kr 
dependence of RT © Rsiae [298], employed here. We see that the three independent 
lo fit domains pin down 7; and £r rather sharply, to [115 MeV, 0.55]. A relativistic 
correction [297] leads to 7; = 105 MeV, fr = 0.60. The ./s dependence of fr at 
the freeze-out surface, from such an analysis [291, 297] is shown in Fig. 7.71. The 
data again exhibit a plateau at SPS energies, which remains to be understood [299]. 
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In the light of the considerations above, the plateau might turn out to reflect a critical 
point focusing effect. 


7.8 Summary 


We have seen that many physics observables in relativistic nucleus-nucleus colli- 
sions can, at RHIC energy 4/s = 200 GeV, be related to the primordial dynamical 
phase, from initial QCD parton shower formation to local momentum space 
equilibration. The time interval is 0.35 to 0.65 fm/c. This domain can be investigated 
at RHIC due to the short interpenetration time, of 0.15 fm/c. From among the bulk 
hadron production signals, total and midrapidity charged particle rapidity densities, 
d Nen/d y, reflect the primordial parton saturation phenomenon [72]. It leads to an 
unexpectedly slow increase of multiplicity with fireball participant number Npart, 
and with 4/s. This observation signals the onset of nonperturbative QCD, a coherent 
shower multiplication by multigluon coherence [62, 65, 70, 71, 75]. It is expected 
to be even more dominant in LHC Pb+Pb collisions at 5.5 TeV. Furthermore, 
elliptic flow, a collective bulk hadron emission anisotropy, also originates from 
the primordial, nonisotropic spatial density profile of shower-produced partons 
[93, 96]. A hydrodynamic evolution sets in at f < 1 fm/c, implying the existence 
of a primordial equation of state (EOS) for partonic matter in local equilibrium. 
Moreover, the experimental elliptic flow data at RHIC are well described by “ideal 
fluid" hydrodynamics, i.e. by a very small shear viscosity 7 and a small mean free 
path A [94, 95]. 

These observations indicate the existence of a (local) equilibrium state at very 
early times in the bulk parton matter stage of the dynamical evolution. This quark- 
gluon plasma (QGP) state of nonperturbative QCD was predicted by QCD lattice 
theory [11]. In Au+Au collisions at RHIC energy this state appears to be realized 
under the following conditions [61]. 

The energy density € amounts to 6-15 GeV/fm?, far above the parton-hadron 
QCD confinement region at € = 1 GeV/fm?, and at about 55 times the density 
of nuclear ground state matter, oo = 0.14 GeV/fm?. Translating to macroscopic 
units we are dealing with a matter density of about 1.3 - 10? kg/m, the density 
prevailing in the picosecond era of the cosmological evolution. The corresponding 
temperature amounts to T = 300—330 MeV (about 3.6 - 10! K), far above the 
“Hagedorn limit" [38] for the hadronic phase (Ty = 165 MeV). From an analysis 
of the production ratios of the various hadronic species (from pions to Omega 
hyperons) arising from this primordial partonic state at hadronization, the statistical 
hadronization model (SHM) determines its baryo-chemical potential [108], ug = 
20 MeV at RHIC energy. This value indicates that one is already close to the near- 
zero net baryon number conditions, ug œ% 0, prevailing in the corresponding big 
bang evolution, where the density of particles exceeds that of antiparticles by a 
fraction of about 107? only. 
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Overall we thus obtain entries in the QCD phase diagram of Fig. 7.1. RHIC 
creates a parton plasma at about T = 300 MeV and ug = 20 MeV. It turns out 
to behave like an ideal fluid and features an extremely short mean free path A, thus 
inviting a description by relativistic hydrodynamics. The small shear viscosity 7 (or 
the small viscosity to entropy ratio 5/5) are highlighted by the striking observation 
that even the fluctuations of primordial parton density in individual, single events, 
are preserved in the single event variation of elliptic flow [267, 268]. Moreover, the 
observed scaling of elliptic flow with hadron valence quark number [94]—and thus 
not with hadronic mass as in radial flow [103]—confirms the implied origin of the 
elliptic flow signal from the partonic phase. 

At the LHC the phase of early QCD plasma formation is expected to shift to yet 
higher energy density, in the vicinity of T = 600 MeV and ug = 5 MeV. One is thus 
getting nearer to the domain of QCD asymptotic freedom, and might expect a falloff 
of the partonic cross section which is extremely high at RHIC [61], as reflected by 
the small 7/s and A values. 

The observed features of the QCD plasma produced at RHIC energy have 
invited the terminology of a "strongly coupled" quark-gluon plasma (sQGP [165]). 
Further evidence for the strongly coupled, non perturbative nature of the primordial 
partonic state stems from the various observed, strong in-medium attenuation effects 
on initially produced high pr partons. In Au+Au collisions at /s = 200 GeV 
this high medium opacity leads to a universal quenching of the high pr hadron 
yield [175] including, most remarkably, heavy charm quark propagation to D 
mesons [175, 178]. We have shown in Sect. 7.5 that the interior of the collisional 
fireball at midrapidity is almost “black” at £ < 1 fm/c. This is also reflected in a 
strong suppression of the back-to-back correlation of hadrons from primordially 
produced di-jets [207, 208], and in a similarly strong suppression of the J/W 
yield [237] which we have shown in Sect. 7.6 to be ascribed to an in-medium 
dissolution of the primordially produced cc pairs [41] at T about 300 MeV. 

The underlying high medium opacity can be formally expressed [188, 193, 195] 
by an effective parton transport coefficient g (Eqs. (7.58) and (7.62)) which 
quantifies the medium induced transverse momentum squared per unit mean free 
path A. The value of ĝ derived from analysis of the various attenuation phenomena 
turns out to be almost an order of magnitude higher than what was expected from 
former, perturbative QCD based models [196]. Analogously, n/s has turned out 
to be much smaller than the previous perturbative QCD expectation [92]. The two 
quantities may be related [300] via the heuristic expression 


&315C — (7.93) 


with C a to be determined constant; C — 1/3 from [300]. This relation shows that a 
larger value of ĝ implies a small value for the ratio 7/s. The latter has a lower bound 
by the general quantum gauge field theory limit /s > (47)! [171], a value not too 
far from the estimate 7/s = 0.09 + 0.02 derived from relativistic hydrodynamics 
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applied to elliptic flow v2 [94, 170]. As a consequence, ĝ cannot grow beyond a 
certain upper bound that should be established at LHC energy. 

These considerations are an example of the recent intense theoretical search for 
alternative methods of real-time strong coupling calculations, complementary to 
lattice QCD. In this regime, lattice QCD has to date been the prime non-perturbative 
calculational tool. However, understanding collective flow, jet quenching and 
primordial photon radiation requires real time dynamics, on which lattice QCD 
information is at present both scarce and indirect. Complementary methods for 
real-time strong coupling calculations at high temperature are therefore being 
explored. For a class of non-abelian thermal gauge field theories, the conjecture 
of a correspondence between anti-de Sitter space-time theory and conformal field 
theory (the so-called AdS/CFT conjecture) has been shown [301] to present such 
an alternative. It maps nonperturbative problems at strong coupling onto calculable 
problems of classical gravity in a five-dimensional anti-de Sitter (ADSs5) black hole 
space-time theory [302]. In fact, this formalism has been recently applied [212] in a 
calculation of the transport coefficient ĝ that governs in-medium jet attenuation, 
resulting in an effective, expansion time averaged jeg = 5 GeV?/fm at T = 
300 MeV corresponding to top RHIC energy, rather close to the experimental 
estimates (c.f. Figs. 7.41 and 7.48). 

Thus, it does not seem to be too far-fetched to imagine [301] that the quark- 
gluon plasma of QCD, as explored at RHIC, and soon at the LHC (and theoretically 
in lattice QCD), and the thermal plasma of certain supersymmetric conformal gauge 
field theories (for example N = 4 "Super-Yang-Mills" (SYM) theory as employed 
in [212, 301]) share certain fundamental common properties. 

The focus on early time in the dynamical evolution of matter in nucleus-nucleus 
collisions is specific to RHIC energy as the initial interpenetration period of two 
Lorentz contracted mass 200 nuclei amounts to 0.15 fm/c only. The subsequent 
evolution is thus reflected in a multitude of observables. It is, at first, described as 
a one-dimensional Hubble expansion [61], setting the stage for the emergence of 
the medium specific quantities addressed above (gluon saturation, direct photon 
production, hydrodynamic elliptic flow, jet quenching and J/ suppression). 
These observables tend to settle toward their eventually observed patterns at 
t < 1.0-1.5 fm/c, owing to the fact that they are most prominently determined 
under primordial conditions of high temperature and density. For example, photon 
production reflects T^, and the transport coefficient g falls with T? [212]. 

On the contrary, at the energy of former studies at the CERN SPS, 6 < Js < 
20 GeV, such early times stay essentially unresolved as the initial interpenetration 
takes upward of 1.5 fm/c. A fireball system in local (or global) equilibrium 
thus develops toward t = 3 fm/c, at T about 220 MeV, closer to the onset of 
hadronization [85, 86]. Also the baryo-chemical potential is much higher than in 
RHIC collisions, 250 < ug x 450 MeV. However, we thus gain insight into 
the QCD physics of the hadronization, and high ug domain of the phase diagram 
sketched in Fig. 7.1, in the vicinity of the conjectured parton-hadron coexistence 
line of QCD. 
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For reference of such data, e.g. statistical species equilibrium (Sect. 7.3), dilepton 
studies of light vector meson “melting” at the phase boundary (Sect. 7.6.3), and 
hadronic event-by-event fluctuations (Sects. 7.7.2 and 7.7.3), to theoretical QCD 
predictions, a recent progress of lattice QCD [8-10] is of foremost importance. 
The technical limitation of lattice QCD to the singular case of vanishing chemical 
potential, ug = O (which arises from the Fermion determinant in the lattice 
formulation of the grand canonical partition function), has been overcome recently. 
Three different approaches have been outlined, the respective inherent approxima- 
tion schemes touching upon the limits of both the mathematical and conceptual 
framework of lattice theory, and of present day computation power even with multi- 
teraflop machines. First results comprise the prediction of the parton-hadron phase 
boundary line, which interpolates between the well studied limits of the crossover- 
region at ug — 0, T > T, and the high net baryon density, low T region for 
which a first order character of the phase transition has been predicted by chiral 
QCD models [15]. We have illustrated this line in Fig. 7.1, and we have shown in 
Sect. 7.3 that hadronic freeze-out occurs at, or near this line at ys > 17.3 GeV 
(top SPS energy). The coexistence line includes an intermediate (T, ug) domain 
featuring a critical point of QCD at which the first order line at higher upg terminates, 
in a critical domain of (T, ug) in which the transition is of second order. One 
thus expects the nature of the confining hadronization transition—an open QCD 
question—to change from a crossover to a second order, and onward to a first 
order characteristics in a relatively small interval of ug that is accessible to nuclear 
collision dynamics at the relatively modest ./s of about 5 to 15 GeV. This domain 
has as of yet only received a first experimental glance, but the top AGS and low SPS 
energy experiments exhibit phenomena that can be connected to the occurrence of a 
critical point and/or a first order phase transition, notably the “SPS plateau" in (m7), 
the non-monotonous K * /zt * excitation function, and the eventwise fluctuations of 
this ratio (Sects. 7.2.6, 7.3, 7.7.2 and 7.7.3). A renewed effort is underway at RHIC, 
at the CERN SPS and at the future GSI FAIR facility to study hadronization, in- 
medium meson modification induced by the onset of QCD chiral restoration, as 
well as critical fluctuations and flow, in the low ./s domain. 


7.9 Postscript 


Reinhard Stock 


7.9.1 Progress of the Field 


Returning, after a decade, to the topic of “Relativistic Nucleus-Nucleus Collisions 
and the QCD Phase Diagram", one is struck by the impressive progress that has 
transported many of the crucial themes, and physics observables, from initial, 
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qualitative consideration to oftentimes quantitative comprehension. We shall briefly 
revisit some of these topics below, sketching the present state of the art, and 
mentioning some completely new developments (such as the study of light system 
collisions and, more spectacularly, the possible advent of Equation of State(EOS) 
information from neutron star mergers). This will, however, not be a real review but 
merely a narrative of recent progress. 

The development of the field took a decisive turn by the startup of the CERN 
LHC collider which did not only move the energy frontier up to truly asymptotic 
values, 2.76 and 5.02 TeV per projectile nucleon pair in Pb + Pb collisions, 
but also introduced a completely new generation of nuclear collision experiments, 
offering an extended reach of physics observables and, moreover, a vastly increased 
event statistics capability ranging upward well into the 10? domain. On the other 
hand, the “blockbuster” innovations from the late RHIC period, minimally viscous 
hydrodynamics with specific viscosity ņ/s tantalizing close to the fundamental field 
theoretical limit (establishing the QGP as a near-ideal liquid), transport coefficients 
of the QGP from in-medium-jet quenching, statistical production of charmonia and 
the impression that charmed quarks are also thermalized in the deconfined QCD 
phase, the saturation of the QCD hadronization temperature near 160 MeV, just 
to mention the highlights, have remained the cornerstones of LHC physics to a 
large degree. This holds, also, for the notoriously evasive gluon saturation and 
Colour Glass Condensate physics, and to the inconclusive critical point searches. 
In retrospect, one may state that one very central and important element of the 
RHIC progress (handed on to the LHC physics) consisted of a clear separation of 
the collisional initialization time period (the first fm/c interval) from the ensuing 
hydrodynamical evolution and its in-medium effects. This provided for a much 
clearer relationship between data and theory: it is the simple feature that projectile- 
target interpenetration which took several fm/c at SPS energies is now shrunk to 
subfractions of a fm/c thus providing for a sharp, global synchronization of the 
successive eras of collisional dynamics: initialization, flow expansion, hadroniza- 
tion, final hadron/resonance "afterburning". This is essential as one wants to tie the 
formation of the physics observables to a specific stage of the dynamical evolution, 
thus making them clearcut diagnostic tools. 

One last thing. Neither RHIC nor the LHC experiments were built to explore 
phenomena typical of a large baryochemical potential, such as the critical point of 
QCD or the existence of a first order phase transition toward yet lower energies 
and, quite generally, the QCD phase diagram at high ug. However, at second 
thought a vigorous development at RHIC, the Beam Energy Scan(BES) program, 
as well as a re-vitalization of SPS experiments(the NA61/Shine experiment), and 
a concurrent brilliant extension of lattice QCD technique concerning ("critical") 
fluctuation observables, were undertaken. Also, the low energy, high ug domain 
will come under renewed focus with new facilities, FAIR at GSI Darmstadt and 
NICA at JINR Dubna, both under construction. 

In the following we will present brief sketches of the recent and concurrent 
work, theoretical and experimental, concerning a couple of (subjectively selected) 
observables: reaction dynamics decomposed into 4 characteristic, separate stages, 
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initialization, hydrodynamic expansion, hadronization and final hadron/resonance 
"gas" expansion. We shall focus on the two characteristic parameters of the QCD 
plasma medium, the specific viscosity 7/s and the transport coefficient ĝ, as 
revealed by anisotropic flow and hadron/jet suppression, respectively. Then we turn 
to charmonium and bottonium suppression in the QCD medium, paying special 
attention to charmed meson statistical equilibrium hadronization. This leads us to 
hadronization and the information we have on the QCD phase diagram. These topics 
are all in full development since RHIC's first decade of operation, and have been 
introduced in the preceding review article. Then we turn to a couple of more recent 
topics, e.g. small systems analysis, formation of light nuclei and antinuclei, critical 
point searches and the possible role of (now observable) neutron star mergers to 
unravel the EOS of cold hadronic matter. 


7.0.2 Reaction Dynamics 


Whereas, at the SPS energies, a central Pb + Pb collision featured some fraction of 
the collisional system already expanding toward hadronization at the time where 
the last participant nucleons just experienced their first collisions, the various 
evolutional stages are clearly separated from top RHIC energy onward. With 
Lorentz contraction factors in the hundreds to thousands domain, the primordial 
collisional system longitudinal size is less than a nucleon radius such that all 
primordial interactions occur “at once" so that after about 0.5 fm/c the initialization 
processes can be expected to settle down toward local equilibrium. This creates a 
primordial fireball that still features a highly clumpy energy density distribution. 
This distribution fluctuates from event to event due to a number of influences 
stemming from remaining impact parameter fluctuation, instantaneous density fluc- 
tuations within the average projectile-target Woods Saxon nucleon position/density 
profile (which are thus getting *photographed"), and by the presence or absence 
of colour saturation processes [303]. The latter can go along with the formation 
of a Colour Glass Condensate State [304] which decays to a parton system at the 
end of the initialization period, thus creating a particular topology of the primordial 
energy density distribution [305]. A model calculation of a zero impact parameter 
Au 4- Au collision transverse energy density distribution was already shown in 
Fig. 7.60, exhibiting pronounced clumpiness. This distribution is then, approx- 
imately, translated into the energy-momentum tensor that starts the next stage: 
hydrodynamic expansion. The most remarkable feature of this evolution (see below) 
is the occurrence of a nearly ideal hydro-flow dynamics which approximatively 
preserves information, the better the lower dissipative processes like shear viscosity 
turn out to be. Thus, most remarkably, the eventwise fluctuating primordial fireball 
profile becomes measurable after flow decoupling, modulo the strength of shear 
viscosity. Two of the objects of desire in this physics [306]! 

The hydrodynamic evolution stage is thus, theoretically, well separable from the 
initialization period. The transition is, in itself, of course a matter of theoretical 
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model building. We mention here, in particular, investigations of the AdS/CFT 
correspondence method [307] with respect to primordial equilibration [308]. Col- 
lective flow then transports the density distribution toward decoupling; this stage 
is analyzed under the general idea to follow the Fourier components e, of the 
primordial energy density distribution eccentricity as they get delivered to obser- 
vation in the form of spatial flow anisotropics quantified [309] by their Fourier 
harmonics decomposition coefficients v, (see ref. [310] and bibliography therein). 
Strictly speaking, the hydro-phase does not directly "deliver" the observables of 
viscous, anisotropic flow to observation. This phase ends at hadronization because 
the mean free path in the ensuing hadron-resonance(HR) expansion stage is too long 
for hydrodynamics to apply. One thus introduces a transition from hydro to HR gas 
by means of a hadronization model that translates hydro matter flow into hadron- 
resonance propagation (by the Cooper-Frye formalism [311]). The latter is then 
described by a hadron transport model acting as an “afterburner” [312, 313]. Such 
so-called hybrid models thus consist of three stages that incorporate stage specific 
dynamical models for initialization, hydro flow and hadron-resonance afterburning. 
This allows for a very wide variety of theoretical choices concerning the overall 
dynamics. Add to this that the experimental data from RHIC and LHC can be 
analyzed with various techniques, delivering the flow coefficients v, (reaction plane 
method, cumulant method etc.). Thus an almost excessive wealth of data to model 
comparisons have been undertaken, with the highest focus on elliptical flow v». 
Data exist for up to ve. In general vj—directed flow—and v2 are predominantly 
related to impact geometry variations of mass 200 collisions, creating anisotropic 
primordial energy density gradients. Whereas vs, in particular, appears to be 
more dependent on the primordial energy density profile and its even by event 
fluctuations, potentially related to Colour Glass Condensate(CGC) formation [314]. 
Very briefly summarizing the results one observes an anticorrelation between the 
initialization-, and the hydro flow-effects. The specific shear viscosity n/s (that 
quantifies the speed with which the system approaches equilibrium) is tantalizingly 
close to the Kovtun, Son and Stariets [315] minimum of 1/427, throughout, but 
its deduced value at RHIC goes up with changing the initialization model from a 
"trivial" Glauber trajectory choice, to a CGC model, as is illustrated in Fig. 7.72 with 
RHIC STAR data [314] for v2 vs. theory [315]. Employing the Glauber initialization 
one deduces 5/s to be near 0.08 (the KSS limit) whereas the CGC model requires 
twice the shear velocity, about 0.16. The Glauber initial fireball is wider, spatially, 
than the one from the CGC model. That ambiguity has not finally been settled yet, 
it persists at the LHC, where the specific viscosity is slightly larger. The overall 
present result, within the CGC type of hybrid dynamical models, is given by Eskola 
and coworkers [316] and illustrated in Fig. 7.73. Similar results for n/s have been 
given by other theory groups [313, 317]. In summary, an unambiguous conclusion 
about the QGP state as a near ideal liquid has been arrived at, whereas the existence 
of the CGC still remains to be finally established. 
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Fig. 7.72 STAR v» data [314] confronted with viscous hydrodynamics [315] for various choices 
of n/s, employing a Glauber- and a Colour Glass type initialization 
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Fig. 7.73 Energy dependence of 5/s from a simultaneous fit of RHIC and LHC data [316], with 
CGC initialization 


7.9.3 Energy Loss in a QCD Medium: Hadron Suppression 
and Jet Quenching 


The hydrodynamic model of the plasma evolution employs quantities that could 
be derived from QCD, for example the equation of state and the specific shear 
viscosity n/s. A different aspect of the QGP is seen by an individual hard parton, 
or a jet, traversing it. In general, the overall medium effect will be energy loss and 
momentum broadening, as well as a re-appearance of the lost energy in the form 
of soft emission. A major obstacle in the interpretation of single parton(seen as 
hard hadrons) and jet attenuation is the medium expansion, i.e. one needs, not a 
hydrodynamic but a genuinely microscopic QCD transport model. Then the key 
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ingredient in a theoretical description will be the parton transport parameter ĝ that 
we introduced in Sects. 7.5.2 and 7.5.3. We recall that it represents the average 
transverse momentum broadening square per unit pathlength, ( p2)/ à (cf. [359- 
362, 366]). It is proportional to the local gluon number density. Recall further that 
the average in-medium energy loss is proportional to g [363]. This parameter can 
be systematically deduced from identified hadron pr spectra in A + A collisions as 
compared to the same spectra in minimum bias p + p at similar energy, resulting in 
the nuclear modification factor RAA as defined in Sect. 7.5.1. For jets the standard 
method is to estimate the jet energy loss by comparing the leading hadron (or fully 
reconstructed jet) energy on the trigger side with that of the away-side jet, where 
geometry has made sure that the trajectory traversed the QGP medium. Note, by 
the way, that ĝ is inversely proportional to 7/s from Eq. (7.93) (from old article!). 
Thus if the latter is in a sense "near-minimal" the transport coefficient has to be 
near-maximal. So the medium opaqueness might completely wipe out the jet at 
lower energies, as was indeed observed at RHIC (see Figs. 7.50 and 7.53): the 
black interior situation. At the LHC jet energies are always sufficient to observe 
both jet sides. We illustrate that in Fig. 7.74 [318] where we see very significant 
suppression in the “low” energy domain of recent LHC Pb + Pb data, gradually 
weakening toward high pr. This indicates that the energy loss is not growing in 
proportion to pr. Corresponding results for the p + Pb jet production at the LHC 
indicates essentially zero medium effect, asserting that the suppression in Pb + Pb 
is indeed a final state, QGP effect. We mentioned above that all data concerning 
heavy flavour charm quark attenuation point to a behaviour similar to that of the 
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Fig. 7.74 Nuclear Modification Factors for jet production, obtained by the ALICE, CMS and 
ATLAS LHC collaborations, for p + Pb and Pb + Pb collisions[318] 
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Fig. 7.75 Raa factors for D meson production in p+ Pb and Pb + Pb, by ALICE at LHC [319] 


light quarks. This is illustrated in Fig. 7.75 by ALICE data [319] for D meson 
production in p + Pb and Pb + Pb, central Pb + Pb collisions again showing 
drastic suppression whereas this is absent in the p + Pb collision. We should also 
mention here that the charmed hadrons follow the flow v, patterns observed for 
light quark hadrons. In the end, all A + A data, both about the shear viscosity, near 
minimal, and the transport coefficient, have been shown to be semi-quantitatively 
consistent [320]. Figure 7.76 shows the result of current state of the art theory; quite 
remarkably, the theory groups have followed the example of the experimentalists, 
forming collaborations. Here we show the transport coefficient results from the 
JET Collaboration [320], extracted from fits to the combined data from RHIC and 
LHC, as of 2014. Indeed, the transport coefficient falls with temperature, consistent 
with the inversely proportional specific shear velocity rising with T. Note that the 
temperature scale reflects the energy density in the center of the fireball at an 
initial time of 0.6 fm/c, that we have ascribed above to the end of the primordial 
initialization period. This would be the medium in which the hard partons born 
by perturbative QCD interactions start embedding. The results of many theory 
collaborations are indicated in this Figure (see ref. [320]). The JET collaboration 
employs perturbative QCD technique to generate the jets in the primordial A + A 
environment, then follows them through the co-travelling non-perturbative QCD 
plasma medium. This situation is characteristic of all “hard probes" studies: one 
has to combine the short range pQCD scale with the long range scale inherent in 
the structure(s) of the plasma QCD medium: the leading parton does not interact 
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Fig. 7.76 Temperature dependence of the jet transport coefficient 4// T?, extracted with different 
approaches for jet quenching at an initial jet energy of 10 GeV, in central A+A collisions (see 
ref. [320]) 


with a free gas of partons! The diagnostic of this state is the goal of all had 
probes physics. The present estimate [320] of the transport coefficient amounts to 
about 1.5 GeV?/fm for a 10 GeV quark jet. The lost energy gets radiated by soft 
gluon emission from the plasma and is indeed recovered experimentally in hadrons 
populating the vicinity of the jet cone. The question of the elementary degrees of 
freedom in a high T QCD plasma is coming within reach by such studies. 


7.9.4 Charmonium 


The J/ V signal entered relativistic heavy ion physics as the “Holy Grail” observable 
of QGP formation by the work of Matsui and Satz [321], at the time the CERN SPS 
A + A program took shape, in the late 1980s. From among all observables predicted 
then to provide evidence of QCD deconfinement and QGP formation, the proposed 
in-medium colour Debye-screening mechanism preventing the primordially pro- 
duced cc pair to hadronize as a J/ V hidden charm meson was presenting a direct 
link between deconfined partons and the suppressed cross section of observed J/ V, 
which was relatively well known from elementary p + p and et + e^ collisions. 
We have reported in Sect. 7.6.1 on the success of the charmonium suppression 
idea at the SPS (Fig. 7.54) and at RHIC (Fig. 7.55). However, we recall that it 
was predicted early on that if free charm and anticharm quarks were thermalized 
in the plasma medium (as has been verified meanwhile) they would hadronize 
statistically forming open charm mesons and charmonia [322] in relative hadro- 
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chemical equilibrium, along with all other hadrons, albeit with a separate fugacity 
factor for c and c because these stem from the early hard collision phase unlike the 
majority of the lighter quarks [323]. We note that this mechanism is oftentimes 
called recombination, erroneously so as the thermal charm quarks do not really 
stem from Debey-screened J/ breakup which covers a mere 1% fraction of 
the total primordial charm-anticharm production. The statistical cc hadronisation 
mechanism and its traces in RHIC data were soon to be fully substantiated [324]. 
At LHC energy we expect a significantly higher number density of thermalized 
c and c quarks as the system enters hadronization and, hence, a higher fraction 
of chemical equilibrium charmonia (for a recent review see ref [325]). Indeed, 
Fig. 7.77 (from [325]) shows that the RAA ratios for J/ drop down far more 
steeply with midrapidity multiplicity (centrality) at RHIC than at the LHC where, 
indeed, not much of the low energy charmonium suppression remains visible, being 
overshadowed here by the statistical production. We might remark that nobody 
nowadays insists on a proof of deconfinement anymore (as was the case in 1986) 
and, furthermore, that the universality of statistical hadron production in A + A 
collisions, of which we see evidence here, is in itself a strong manifestation of 
deconfinement. To this topic we turn our attention next, but remark, in passing, that 
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Fig. 7.77 Multiplicity(centrality) dependence of the Nuclear Modification Factor for J/ Y produc- 
tion at mid-rapidity at top RHIC, and at LHC energy [325] 
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the suppression theme is now turning over to the bottonomium production at the 
LHC [326], visible there because of the much lower thermal bottom quark density. 


7.9.5 Hadronization and the QCD Phase Diagram 


This chapter will have two main topics, first the analysis of hadron production 
multiplicities with the Statistical Hadronization Model (SHM) that incorporates the 
canonical or grandcanonical partition functions of a hadron-resonance gas (HRG) 
model. This will be an update on Sect. 7.3. Second, however, we turn to new 
developments of lattice QCD theory, extension to finite baryochemical potential and 
determination of the hadronization parameters T vs. upg from, first, a determination 
of the overlap between Lattice QCD and the HRG model [327], to reveal the 
"hadronization point" and, second, from fitting new data concerning fluctuation of 
conserved charges to higher order Lattice susceptibilities [328]. Some remarks are 
also necessary with regard to the Lattice conclusion that the parton to hadron phase 
transition is “merely” a cross-over at small ug, top RHIC and LHC energy. 

We have shown in Sect. 7.3.4 that, ideally, the Statistical Hadronization 
Model(SHM) freeze-out curve should reveal the QCD parton-hadron phase 
transformation line in the (T, ug) plane, i.e. the most prominent feature of the QCD 
phase diagram. This follows from the assumption that inelastic reactions (except 
resonance decays) among hadrons cease directly at the instant of hadronization: 
chemical hadron freeze-out determines the hadronization “point” corresponding to 
the chosen A+ A collision energy. Add, for correctness: as long as freeze-out occurs 
from the QCD hadronization transition, i.e. from a QGP. About freeze-out from hot, 
dense hadronic matter in expansion, we do not yet know the final interpretation 
of the hadronic multiplicities. Now, the above ideal picture, absence of final state 
effects from the afterburning phase, may require certain corrections. It has been 
shown [329] that, on the one hand, the bulk hadrons from a relativistic A + A 
collision (i.e. pions and kaons that carry about 95% of the total cm energy output 
at LHC energy) do indeed pass the afterburner stage essentially unchanged. At the 
relatively low SPS energies, this applies also to protons and Lambdas. Whereas 
the antibaryon yields get reduced, away from chemical equilibrium, by annihilation 
processes that occur throughout the hadron/resonance expansion phase. At top 
RHIC and LHC energies, baryons and antibaryons are similarly affected, with the 
exception of the Omega hyperons which suffer little annihilation. These afterburner 
effects were quantified [330] by the microscopic transport model UrQMD [312] 
which is employed in many of the current multi-stage “hybrid” models [313, 316] 
for the afterburner stage. The thus obtained modification factors for each species 
were applied to the grand canonical partition functions of the SHM [330]. We 
show the results in Fig. 7.78, which gives (T, ug) points for AGS, SPS and LHC 
energies. The overall result of the annihilation corrections is seen to be an upward 
shift of the hadronization curve(see also [331]), above the curve resulting from 
the uncorrected SHM procedure which agrees with other SHM work [332]. The 
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Fig. 7.78 Freeze-out points in the (T, upg) plane extracted from hadron multiplicity data at LHC, 
SPS and AGS energies, obtained with the standard Statistical Hadronization Model, and with the 
SHM corrected for baryon-antibaryon annihilation during the afterburner phase [330] 


modified approach also explains the so-called “nonthermal proton to pion ratio 
puzzle” at the LHC [332], much discussed but simply resulting, in this model, 
from proton annihilation going to pions, thus decreasing the p/z ratio; of course a 
non-thermal effect [333]. The pseudo-critical temperature at ug=0 turns out to be 
164 + 5 MeV, as compared to about 155 MeV in the standard approach. This new 
development is still vigorously contested by the ALICE community [325, 332]; it 
appears highly desirable that groups employing a different afterburner model turn 
their attention to the final state effects. 

The reader will have noticed that entries from RHIC are missing in Fig. 7.78. This 
is the result of another tension in the community. By far the most comprehensive set 
of hadron multiplicity data stems from the STAR experiment at RHIC [334] which, 
up until now, has published baryon/antibaryon multiplicities without correction for 
feeddown from weak hyperon decays which are misidentified as primary particles 
(this omission is now disposed of, for subsequent data taking, by the new STAR 
vertex tracking detector [335]). Accidentally, the extra particles from feeddown 
do closely compensate for the losses by annihilation, and a picture of perfect 
equilibrium thus emerges from SHM analysis [334]. Then, apparently, there arises 
no p/z puzzle at top RHIC energy, the two missing corrections cancelling. We are 
at upg about 20 MeV, very close to the value of about 1 MeV encountered at LHC. 
A kink in the phase diagram? Clearly an inconsistency. In ALICE at the LHC, the 
primary vertex is very precisely reconstructed, eliminating secondary decays, and 
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NA49 at the SPS has employed a formidable correction simulation to determine the 
feeddown fractions, of order 50%. Once all this dust settles, one can still expect 
an interesting freeze-out curve [323], which should reflect influences of a critical 
point, perhaps even adjacent to a new, further form of QCD phase transition, from 
quarkyonic [337] to hadronic matter, that interpolates between the partonic and the 
hadronic phases at high ug [323]. This region in the QCD phase diagram will 
receive the necessary attention by NA61(SPS) and by the FAIR and NICA projects. 

We turn to a second, very much elaborated source of information about the 
parton-hadron phase boundary, which was made possible by recent developments 
of Lattice QCD. In a way, this progress was a generalization of the attempt to 
extend Lattice QCD from jzg=0 to finite baryochemical potential [336], by a Taylor 
expansion of the reduced thermodynamic pressure in terms its derivatives with 
respect to uq/T : 
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where derivatives are taken at ug = 0. The derivatives of the reduced pressure, of 
order n on the r.h.side of equ. [304], with respect to a chemical potential, are called 
susceptibilities. In general, fluctuations of conserved quantum numbers are obtained 
as derivatives of the pressure to various chemical potentials JL x / T, where X stands 
for net baryon number B, charge Q, strangeness S or charm C (see [328, 337, 338] 
and references therein). Also one can formulate correlations of net charges X and 
Y by mixed susceptibilities [339] which contain derivatives jointly to wx and py. 
Now, all such susceptibilities could be obtained from recent Lattice calculations, up 
to order six, as well as from the Hadron Resonance Gas (HRG) model and, most 
significantly, some of them can be related to experimentally accessible quantities. 
For example, net baryon number fluctuations (one can approximate them by net 
proton number fluctuations) are expressed by properties of the event by event 
multiplicity distribution, e.g. mean (Mp), variance (a3), skewness (Sp) and kurtosis 
(p): 
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Fig. 7.79 The overlap between Lattice QCD and Hadron Resonance Gas(HRG) model cal- 
culations of the normalized pressure, energy density and entropy density, as a function of 
temperature [341]. It is seen that the results match over a relatively broad temperature domain, 
characteristic of a cross-over transition, and slightly different for the three quantities 


where the n-th order susceptibilities y are obtained from the corresponding partial 
derivatives of the Lattice or HRG pressure P(T, ux) with respect to the baryon 
chemical potential, and the mean, variance, skewness and curtosis from the data on 
net proton number fluctuations e.g. by the STAR Collaboration at a set of RHIC 
energies [340]. We cannot do justice to this very wide body of recent work here 
but merely present two typical results. Figure 7.79 illustrates the Lattice-HRG 
overlap analysis [341] technique showing the normalized pressure, energy density 
and entropy density as a function of temperature, at ug = 0. The HRG curves 
start seriously departing from the Lattice results above about 170 MeV. Recall 
that the parton-hadron-transition is a relatively broad cross-over at ug=0; so we 
would conclude that the crossover domain ends here. The center of the pseudo- 
critical region might thus be at about 160 MeV, at ug = 0; the authors also show 
the region center and width estimate from other analyses [337]: not really fully 
supported by the present analysis. The most recent results in the comparison of 
Lattice with the STAR kurtosis data are shown in Fig. 7.80 (taken from ref [338] 
which gives a comprehensive review of the topic). It shows the ratio of the two 
susceptibilities xL and 32 , equal to kurtosis times variance, vs. temperature, 
from various state of the art Lattice calculations, and a band about a freeze-out 
temperature of T — 153 MeV indicated by the data. We cannot exhaust the topic 
which is under vigorous development but wish to enumerate points of concern that 
are presently under investigation: 


1. Itis unclear whether various hadronization observables like hadron multiplicities 
and higher order multiplicity fluctuations should refer to an identical freeze- 
out temperature within the relatively broad pseudocritical temperature band. The 
latter might freeze out later (sequential freeze-out). 
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Fig. 7.80 The ratio of the two susceptibilities x? and x2, equal to kurtosis times variance, vs. 
temperature, from various state of the art Lattice calculations [338], and a band about a freeze-out 
temperature of 7 — 153 MeV indicated by the data 


2. Net proton multiplicity is not a conserved quantity but taken here as a proxy for 
net baryon number: an approximation sensitive to the experimental energy and 
acceptance [342, 343]. 

3. Higher order fluctuations and their ratios receive sizeable contributions from the 
fluctuation, event by event, of the participant nucleon number [343]. 

4. Higher order fluctuations can be dampened in the course of the afterburner 
expansion [344]. 

5. The standard Hadron-Resonance Gas model employed in the Lattice-HRG 
overlap study may need amendments due to Van der Waals-Type hadron- 
hadron repulsion [345], or to high lying strange resonances that are not yet 
experimentally known [346]. 


Clearly, these questions need to be addressed before we can finally conclude on 
the QCD phase diagram and the parton-hadron phase boundary. 


7.9.6 New Topics 
7.9.6.1 Proton Induced Collisions 


In defining the nuclear modification factor R44 (Sect. 7.9.3) we have employed, in 
the denominator, the experimental minimum bias p + p momentum distribution at 
the same cm energy as the energy per participant nucleon pair of the A + A data 
forming the numerator. Thus, the p + p cross section is employed as a reference, 
to incorporate the “no QGP physics" situation that is captured in the Glauber 
model which describes the A 4- A collision as an superposition of independent, 
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minimum bias nucleon-nucleon collisions occurring at the microscopic level. Of 
course this cannot be a realistic model but for the hard, partonic collisions occurring 
during the primordial interpenetration we may still employ it as a standard of 
comparison. Nevertheless it was tempting to check whether p + p collisions 
revealed any of the effects ascribed to the QGP medium in A + A collisions, such 
as grand canonical hadrochemical equilibrium at 160 MeV apparent temperature, 
collective hydrodynamic flow in its multifold Fourier harmonics phenomena, jet 
attenuation etc., thus making it useless as a “no new physics" standard reference. 
This experimental program occupied the first half of this decade, both at RHIC 
and LHC. Lacking space for detail, we briefly summarize the main results. Indeed, 
minimum bias p + p collisions turned out to be essentially free of the new 
physics: ok to employ them in Raa. However, onset behaviour of grand canonical 
hadronization [347], of elliptic flow [348], and new modification patterns of jet 
production [349] were observed in high multiplicity selections of p + p data 
at LHC where the midrapidity charged particle density increases from about 8 
in minimum bias mode, up to well beyond 20 in the ALICE study [347], with 
extremes ranging up to about 80. No surprise then: these are geometrically small 
but energy rich systems which lead to equilibrium conditions due to extreme density 
of degrees of freedom. Such that the bulk phenomena, advent of grandcanonical 
hadronization [347] and development of anisotropic hydro flow, exhibit onset 
patterns. Whereas the hard probes which depend on pathlength might well still 
stay unaffected of the medium. No big surprise, but initially the community was 
quite smitten by the surfacing of typical QGP signals in an elementary collision. 
A different, completely new aspect concerning jet production: the selection of 
extreme multiplicity density at midrapidity clearly represents a strong bias as to 
the primordial partonic collision generation, possibly selecting for changes in jet 
observables [349]. An interesting, unexplored aspect of jet production. 

Along with p + p came a wave of p + Au/Pb collision studies. Here we 
would expect that the projectile side of rapidity space shows essentially p + p 
properties whereas the heavy nucleus side shows, firstly, a strong dependence of 
impact parameter and participant nucleon number variation. Second, the resulting 
primordial fireballs should grow in proportion to Nparr, leading to clear collective 
signals in the bulk production observables, emerging with increasing centrality 
(multiplicity), as was indeed observed [347, 350]. However, again the hard probes 
did not show significant attenuation as we have seen in Figs. 7.74 and 7.75 for LHC 
collisions [318, 319]. It is reassuring to confirm that extended path length in hot QGP 
matter is required in order to produce attenuation, whereas cold nuclear spectator 
matter has a very low transport parameter [320]. In general p -- Pb collisions share 
the complicating feature, common to all light-on-heavy collisions, that the effective 
center of mass for soft production moves away from the corresponding p + p cm 
frame that controls hard processes, as Nparı increases. At the LHC this shift goes 
up to two rapidity units for very central collisions, potentially causing more or less 
trivial effects in the attenuation of hard partons traversing the soft-produced QGP 
medium [351]. Also the effective midrapidity position for soft, bulk production is 
now outside the acceptance for ALICE. 
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7.9.6.2 Lambda Polarization and Fireball Vorticity 


Huge global angular momenta are generated in non-central A + A collisions. In fact 
they get converted to the vorticity of a QGP [352] and get transmitted to particle 
polarization at the stage of hadron formation and particlization whence hadrons are 
emitted from the hypersurface of a hydrodynamically expanding fireball (51, 52). 
The global angular momentum can thus lead to the local polarization of hadrons. 
The polarization, in turn, can be measured by A and A hyperon weak decay into a 
proton and a pion which is "self-analyzing" since the proton is emitted preferentially 
along the direction of the Lambda spin in its rest frame. The global polarization (the 
net polarization of the local ones in an event which is aligned in the direction of 
the event plane, i.e. along the direction of the angular momentum of the plasma) 
of ^ and A has recently been measured by the RHIC STAR Collaboration [355] at 
collisional energies below 62.4 GeV. At higher energies, including LHC, the falling 
of the polarization with energy [355] still precludes measurement. The measurement 
determines the event average of sin(®, — Wrp), where , and Ppp are the 
azimuthal angle of the proton momentum in the Lambda rest frame, and that of 
the reaction plane. Its orientation cannot be directly measured but is approximated 
by the event plane determined from the hadronic directed flow; this is accounted for 
by a reaction plane resolution factor [355]. Alternative methods have been proposed 
recently [356]. For an overview see ref. [357]. 

From the data one can estimate [353, 354] the local vorticity in the plasma, the 
result implying that the QCD matter created in such collisions is the most vortical 
fluid known as of yet. Moreover, this new observable can shed a new, independent 
light on the equilibrium properties encountered (or not) in the course of (relativistic 
hydro) expansion toward hadronization [358]. 


7.9.6.3 EOS from Neutron Star Mergers 


The recent observation of gravitational waves from a neutron star merger has, in 
fact, been anticipated with regard to its possible sensitivity to the EOS of dense 
hadronic matter at low temperature and high baryochemical potential. This is a 
topic of nuclear collision research already since the early BEVALAC studies at LBL 
when it was realized that the EOS should be of importance, both, to the dynamics 
of supernova explosions and to the radial structure of neutron stars, also affecting 
their maximum mass with regard to black hole formation. The hope was to derive 
the EOS from the data concerning hydrodynamic directed sidewards expansion, 
and from Kaon production in A + A “compressive collisions" [359] at the low 
BEVALAC energies, in the GeV per nucleon energy range. One of the key problems 
arose from the fact that even at these energies the collisional fireball temperature 
ranges up to about 100 MeV whereas the above astrophysical phenomena require the 
EOS of cold, compressed matter [360]. The newly accessible Neutron star mergers 
promise to open up a window to the low T EOS [361], and to crucial transport 
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parameters [362], once more experimental information becomes available, as can 
be expected. 


7.9.6.4 Production of Light Nuclei in A + A Collisions 


At RHIC and LHC, a large number of light nuclei and antinuclei have been mea- 
sured [325, 363, 364], from deuterium to anti-alpha. This continues a tradition dating 
back to BEVALAC time [365] and SPS [366]. Dating back is also the controversy 
concerning model description or, alternatively stated, the lack of a comprehensive 
understanding [325, 364, 367]. The Statistical Hadronization Model(SHM) gives a 
very good description(hadro-chemical freeze-out) of the LHC yields [325], along 
with all other hadron multiplicities. The coalescence model [365, 367] addresses 
the end of the hadron/resonance expansion stage (kinetic freeze-out), whence the 
produced nucleons, spread out in phase space, overlap, to a certain degree, with 
the internal wave functions of the various clusters. There are difficulties in both 
views if one takes them literally. The clusters cannot have existed at 160 MeV, the 
hadronization temperature. On the other hand, the coalescence cannot proceed on- 
shell because of the cluster binding energies (that are ignored in the model). All this 
points to a deeper flaw in, either the models, or their conventional understanding. 
The role of quantum mechanics seems to be missing, as pointed out long ago by E. 
Remler [368]. Hadronization does not directly produce a decoherent, on-shell state, 
in A + A collisions. If hadronization occurs via the quantum mechanical decay 
of initially produced colour singlet clusters (see Sect. 5.3.3), then Fermis Golden 
Rule refers to the phase space weights AFTER decoherence. And, in fact, in the 
SHM partition functions we employ the set of in-vacuo free masses of the produced 
particles, also for the clusters, with apparent success. One might then speculate 
that the number densities and their ratios are fixed by the entropy at hadronization, 
but we have to wait until after the freeze-outs to have them decohere. This is a 
speculation, of course. 
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Chapter 8 A) 
Beyond the Standard Model gasii 


Eliezer Rabinovici 


8.1 Introduction 


Starting sometime in 2008/2009 one expects to be able to take a glimpse at physics 
at the TeV scale. This will be done through the Large Hadronic Collider (LHC) at 
CERN, Geneva. It will be a result of an unprecedented coordinated international 
scientific effort. This chapter is written in 2007. It is essentially inviting disaster 
to spell out in full detail what the current various theoretical speculations on the 
physics are, as well motivated as they may seem at this time. What I find of more 
value is to elaborate on some of the ideas and the motivations behind them. Some 
may stay with us, some may evolve and some may be discarded as the results of 
the experiments unfold. When the proton antiproton collider was turned on in the 
early eighties of the last century at Cern the theoretical ideas were ready to face 
the experimental results in confidence, a confidence which actually had prevailed. 
The emphasis was on the tremendous experimental challenges that needed to be 
overcome in both the production and the detection of the new particles. As far as 
theory was concerned this was about the physics of the standard model and not about 
the physics beyond it. The latter part was left safely unchallenged. That situation 
started changing when the large electron positron (LEP) collider experiments also at 
Cern were turned on as well the experiments at the Tevatron at Fermilab. Today it is 
with rather little, scientifically based, theoretical confidence that one is anticipating 
the outcome of the experiments. It is less the method and foundations that are tested 
and more the prejudices. It is these which are at the center of this chapter. Some 
claim to detect over the years an oscilatory behavior in the amount of conservatism 
expressed by leaders in physics. The generation in whose life time relativity and 
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quantum mechanics were discovered remained non-conservative throughout their 
life. Some of the latter developed eventually such adventurous ideas as to form 
as a reaction a much more conservative following generation. The conservative 
generation perfected the inherited tools and has uncovered and constructed the 
Standard Model. They themselves were followed by a less conservative generation. 
The new generation was presented with a seemingly complete description of the 
known forces. In order to go outside the severe constraints of the Standard Model 
the new generation has drawn upon some of the more adventurous ideas of the older 
generation as well as created it own ideas. In a way almost all accepted notions were 
challenged. In the past such an attitude has led to major discoveries such as relativity 
and quantum mechanics. In some cases it was carried too far, the discovery of the 
neutrino was initially missed as energy conservation was temporarily given up. 

The standard model is overall a very significant scientific achievement. It is a 
rather concise framework encompassing all the known properties of the known basic 
interactions. It is arguably the most impressive theoretical understanding of a large 
body of experimental information existing. An understanding backed by precise 
predictions all verified by high quality experiments. In this context it may seem 
surprising that one is searching for anything beyond the Standard Model. There are 
however diverse scientific reasons for the search of the beyond. 

In 2007 the scientific community was aware of quite a few gaps in the 
understanding of the particle interactions. One class of observations posed obvious 
pressing problems: 


* There is a large body of evidence that the so called dark matter should be 
composed mostly of different particle(s) than those that serve as the building 
blocks of the standard model. What are they? More recently also what is called a 
dark energy was needed to explain the data. Its possible origin(s) is under active 
study. 

* A standard model for cosmology is forming and it includes in most cases 
versions of inflation. Such models seem to require the existence of a new heavy 
particle(s). What are they? On a more speculative note, models require more 
detailed understanding of how physical systems behave in big bang/crunch like 
circumstances and of how and if universes may form. 


Another set of observations could be either defined as posing problems requiring 
an explanation or as pointing to new directions only after being combined with a 
certain amount of theoretical prejudice (7P). In the past some major advances were 
driven by such combinations. 


* Three known interactions, the colored, the weak and the electromagnetic inter- 
actions all obey the dictum of quantum mechanics and are all well described by 
gauge theories but have otherwise very different properties and strengths at the 
energy scales probed till 2007. A TP, a strong and deeply ingrained one, suggests 
that all interactions, those known and those yet to be discovered should unify 
at a certain higher energy scale. This is the idea of a Grand Unified Theories 
(GUTS). Eventually it was found that to realize a particular aspect of this, the 
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convergence of the all the couplings to a single value at an appropriate energy 
scale, there should be physics beyond the standard model. In Super Symmetric 
(SUSY) systems this indeed may occur. In fact the unification is achieved at a 
distance scale rather close to the Planck scale, the scale at which gravity becomes 
strong and its quantum effects, if there, should become noted. Thus the fourth 
force Gravity is naturally added to the unification scheme. An older realization 
of this TP was that all known interactions are but the low energy descriptions of a 
system containing only the gravitational force and residing in a higher number of 
space-time dimensions. This had led to Kaluza Klein theories and their variants. 
This idea has been revisited with the advent of SUSY and string theory. 

* The observation of what is called dark energy and its very possible confirmation 
of a very small, but not vanishing, cosmological constant is viewed by many 
as a major problem lacking an explanation. The 7P behind this is that any 
fundamental quantitative property of nature should be explained and not fine 
tuned. The absence of any significant amount of CP violation in the strong 
interactions is another such problem. The discovery of a new particle, the Axion, 
could be a signal of the solution of the latter one. The discovery of another 
particle, the Dilaton, could indicate a resolution of the former. This issue has 
led also to the reexamination of the so called enthropic principle. 

* Many predictions of Classical General Relativity were confirmed experimentally 
over the years. The TP in 2007 is that there should be a quantum theory of gravity. 
Such a four dimensional quantum theory of gravity is not well defined within 
only field theory. This has been a driving force in the study of the properties of a 
theory were the basic constituents of nature are not particles but extended objects, 
including strings. 

* A very successful framework to explain the basic interactions in particle physics 
is the so called Wilsonian one. It is very powerful when the laws of physics 
are such that different largely separated scales are essentially decoupled from 
each other. The physics beneath any energy scale (cutoff) is well described by 
operators whose scaling dimensions, in d spacetime dimensions, is not much 
larger than the same d. In many cases there is only a finite number of such 
operators, i.e. only a finite amount of terms in the Lagrangians describing the 
Physics below the energy scale. This TP has been tested successfully time and 
again but may eventually be falsified, perhaps in a theory of gravity. But given 
the validity of this method physical quantities should be only slightly dependent 
on the cutoff scale, and thus on the unknown physics extending beyond it. 
Generically the mass of scalar particles is strongly dependent on the cutoff. In 
particular instead of the scale of such masses being set by the weak interactions 
scale they will depend strongly on the cutoff. The only known, in 2007, physical 
scale beyond that of the weak interactions is of the order of the Planck scale. The 
discovery of Higgs particles whose masses is in the TeV range or lighter will thus 
require a large amount of fine tuning. The 7P does not accept that. This is one 
manifestation of the so called hierarchy problem. This had led to ideas such as 
technicolor and supersymmetry as properties of beyond the standard model. 
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It seems very likely in 2007 that eventually the neutrino masses will be added 
without caveats to the pantheon of particles, the particle data tables. Some TPs 
point to new physics at a rather high mass scale as the origin of small neutrino 
masses. 


It is interesting that very few of the theoretical ideas and visions used to the 


address the ample set of problems mentioned above were originally created directly 
for that purpose. They were more tools whose applications were found only well 
after they were formed. 


The theoretical structures discovered and created where an outcome of an urge 


to question, generalize and unify almost anything. 


In addition to the attempts to unify all symmetries and considering extra 
dimensions it was suggested that the topology of the extra dimensions may 
determine the low mass particle spectrum. These extra dimensions were assumed 
for year to be small but it turned out they could also be large, leading to theories in 
which large extra dimensions play a key role. A variant of this idea is that there 
exists a hidden sector where many desired things occur, SUSY is broken, the 
cosmological constant is (nearly) cancelled to name some. These effects are then 
communicated to the Standard Model particles by messengers. In many cases the 
weak gravitational force is designated that role. 

A natural direction of generalization is to question the point particle nature of 
the basic constituents of nature. Is there a consistent theory of elementary higher 
dimensional objects such as strings or membranes, this direction of research is 
developed under the name of string theory. It has led to much more satisfactory 
theory of gravity. It has success, faces many challenges not least of which is the 
present lack of experimental evidence and has a touch of magic. This magic has 
had already a significant impact on Mathematical Physics. 

The never satiated desire for simplicity may suggest to remove even the concept 
of a point particle which propagates in space time and for that matter to remove 
even the concept of space time. The idea is that space time is but an emergent, 
long distance, phenomena. The search for “true” underlying picture is still on. 
The quest for knowledge includes a succession of Copernican like revolutions. 
After each such step the researcher finds herself even further removed from the 
Center. A possible prospect for such a revolution is that our universe is but one 
of many universes. The idea emerges in quite a few contexts not necessarily 
unrelated. These include the many world interpretation of quantum mechanics, 
third quantization in quantum gravity, the mutliverse in models for inflation and 
in string theory. In string theory the idea has been refined in the brane world 
picture, our universe reflects an underlying structure in which different particles 
reside on different multi dimensional subspaces. 

In the process of quantization it was assumed that the space coordinates 
commute with each other as do their conjugate momenta. Mathematically one 
may construct noncommutative manifolds, manifolds whose coordinates do not 
commute. This had led to the study of physics on such manifolds. This an the 
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idea that the underlying theory of nature depends only on topology have both 
been extensively studied but at this stage still more in the realm of mathematics. 


The various theoretical ideas mentioned above were extensively studied in the 
second part of the twentieth century. The respective developments are documented 
in many books covering thousands of pages. In this review we have made the 
following choices. The emphasis is on various aspects of SUSY. There has been 
significant progress in understanding the dynamics of systems which have in some 
form or another supersymmetric features. These systems have been researched 
in the weak coupling and the strong coupling regimes, perturbatively and non- 
perturbatively. The properties uncovered are remarkable in some cases. We also 
review, more briefly, the attempts to lay out a framework suitable for extracting 
supersymmetric signatures of nature. This rich area of research will better receive 
its due rewards after the accumulation of actual experimental data. The more 
"senior" areas of research such as examining the possibility of the existence of 
extra dimensions, grand unified theories and superstring theory will be reviewed 
in a much more descriptive manner. This choice was influenced by the possibility 
that the LHC will shed some concrete light on what is beyond the standard model. 


8.2 Super Symmetry [1] 


Super Symmetry (SUSY) embodies several forms of unification and generalization. 
It joins space time and internal symmetries and it generalizes the meaning of 
space time by adding fermionic components to the canonical space time bosonic 
coordinates. An original stated motivation for SUSY in field theory was to have a 
symmetry which was able to relate the self couplings of bosons, the self coupling 
of fermions and the Yukawa couplings of fermions to bosons. Eventually SUSY 
while indeed unifying such couplings has given rise to a multitude of possible non- 
equivalent ground states. Such a degeneracy of ground states was of a magnitude 
unknown before. In string theory, SUSY stabilized superstring theories by removing 
the tachyions which plagued the bosonic string theories. Over the years the 
motivations have varied and evolved. SUSY was called upon to emeliorate the so 
called hierarchy problem which will be reviewed below and it was perceived as 
an omen that in a SUSY theory one was able to arrange that the colored, weak 
and electromagnetic couplings unify at a single high energy scale. That scale not 
far from the Planck scale. In addition it was discovered that this enriched structure 
allows to obtain exact results in situations were only approximations were available 
before. In particular for four and higher dimensional systems. In its presence a very 
rich dynamics has been unraveled. 
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8.2.1 Elementary Particles in SUSY Models: Algebraic 
Structure 


Many successes resulted from the application of symmetry principles to systems 
such as atoms, nuclei and elementary particles. They have been obtained even in 
the face of a rather incomplete understanding of the dynamics of these systems. 
Consequentially evermore higher symmetries were searched for, and in particular 
unifying ones. Such was the search for a symmetry that would contain in a 
non-trivial way both the Poincar’e space time symmetry and internal symmetries 
such as Isospin and flavor SU(3). It was shown that that was impossible, only 
a trivial product symmetry is allowed. Theorems are proved under assumptions, 
time and again new important directions emerge once significant loop holes in the 
assumptions are uncovered. Such was the case for obtaining consistently massive 
spin one particles and such was the case here. Allowing the algebra of the symmetry 
generators to be graded, i.e. to include both commutators and anti commutators a 
new structure containing both Poincar’e and internal symmetries was discovered. 

A simple version of the SUSY algebra is given by the following anticommutation 
relations which obey the following commutation relations: 


{Qu 4] = 20% Pu, {Qu 05] = [Oi Dp} = 9. (8.1) 


Where the Q, are fermionic generators of supersymmetry. P, are the generators 
of space-time translations and the o matrices are the Pauli matrices. 


[P5 Qa TL Pu, Qa =i Pu, Po] = 0 (8.2) 


This is called the N = 1 supersymmetry algebra. 
It can be generalized to include a higher number of supersymmetries. For exam- 
ple in four space-time dimensions there are also N = 2 and N = 4 supersymmetries: 


[0i Qi] = 25 o" P, + ugUij + (roug (8.3) 


i and j run over the number of supersymmetries, U and V are the central charges 
ie. they commute with all other charges, (they are antisymmetric in ij). When 
they do not vanish they are associated with what are called BPS states such as 
monopoles. The d = 4 realisations have as, u,v = 0,1,2,3 the space-time indices. In 
four dimensions one has two component Weyl Fermions. Those with o or £ indices 
transform under the (0, 1) representation of the Lorentz group; and those with dotted 
indices, & or B transform under the (5,0) representation. 

The possible particle content of supersymmetric (SUSY) theories is determined 
by the SUSY algebra. 

Consider first the massless representations of N = 1 supersymmetry. 
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The simplest is the called the chiral multiplet. It contains two real scalars and one 
Weyl Fermion: 


1 1 


In the above table, first are written the helicities; then the associated component 
fields, « denotes a complex scalar and y; a Weyl Fermion; and finally are the number 
of physical degrees of freedom carried by the Bosons and Fermions. The massless 
multiplet containing a spin one boson and a spin one half Fermion is called the 
vector multiplet. Its content in the case of N — 1 is: 


11 
(2-551) (àa, An) 2:2) (8.5) 


A is a Weyl Fermion and A, is a vector field. 
For N = 2 supersymmetry, there is a massless vector multiplet: 


tl l 
(-1 EM Pi) (9.9) + (har An) (4.4) (8.6) 
2 2 


(p1, Vi) + (G2, V2) — (4.4). (8.7) 


t3| —t3| — 


For Massive multiplets, in N — 1, there is again the chiral multiplet which is 
the same as the massless multiplet but with now massive fields. The massive vector 
multiplet becomes: 

1 
—-5 0 
2 


Where A is a real scalar field. The massive vector multiplet has a different field 
content than the massless vector multiplet because a massive vector field has an 
additional physical degree of freedom. One sees that the massive vector multiplet 
is composed out of a massless chiral plus massless vector multiplet. This can occur 
dynamically; massive vector multiplets may appear by a supersymmetric analogue 


t3| —t3l — 


i) (h, Was àa, An) (4,4) (8.8) 
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of the Higgs mechanism. With N — 4 supersymmetry, the massless vector multiplet 
is: 


a Os, 9l, Au) (8, 8) (8.9) 


| 
eS 
top top ee - 2] 7 
ocoooooco 
NIINI =N= = 


where J = 1..6, a = 1..4. 

These are the unitary representations of the Super Symmetry algebra whose par- 
ticle content allows them to participate in renormalizable interactions. Any higher 
supersymmetry in four dimensions would have to involve non-renormalizable terms. 
Mostly for particles with spin higher than one. 


8.2.2 Supersymmetric Lagrangians 


The task of writing down explicit supersymmetric Lagrangians was quite laborious. 
Originally all the interaction terms had to be written down explicitly. In some 
cases this had turned out to be much simpler by the introduction what are called 
superfields. These will be described below and make use of the anticommuting 
Grassman variables suitable to describe fermions. This is called the Super Space 
notation. In the spirit of generalization, the mathematical book keeping device has 
been elevated by some to a generalzation of regular space whose coordinates are 
denoted by communting numbers to a superspace in which some variables are 
Grassman variables. This superspace has its own geometircal properties and it was 
suggested to give it also a life of its own. It is not clear yet how fundamental 
the superspace description is but adopting this notation leads to considerable 
simplification. We note here that another generalization of space has been suggested. 
In regular space the coordinates commute also quantum mechanically, it was 
suggested to explore the situation that space coordinates not commute in quantizing 
the theory. This has experimental consequences, as of now this suggestion has no 
experimental backing. 


8.2.2.1 Superspace, Chiral Fields and Lagrangians for Spin Zero 
and One-half Particles 


Returning to superspace, spacetime can be extended to include Grassmann spinor 
coordinates, 05,04. Superfields are functions of the superspace coordinates. Con- 
structing a Lagrangian out of special types of superfields provides a useful way 
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to construct explicitly supersymmetric Lagrangians. The integration formulas for 
Grassmann variables are: 


ð 
[ = = 1, [ 4-9 (8.10) 
Which results also in: 
= 3L 
| esee = [ve 2 (8.11) 
00102 


The supercharges can be realized in superspace by generators of supertransla- 
tions: 


a = a 
=a = ict 0*9, Ox = -zt i0" o! Ay. (8.12) 
a a 


To define the concept of (anti) chiral fields one defines a supercovariant 
derivative: 


ə =u = 0 
Da = — + iol 0 94, Da =—-—— — 10% oN, ay. 8.13 
a 86, ^ Toà u à 98; IU CoG Ou (8.13) 


A superfield ® is called "chiral" if: 


Dy - 0. (8.14) 


Anti-chiral fields are defined by reversing the role of the 0 and 0. 
One introduces the variable, 


y" = x" 4 igo" 0 (8.15) 


in terms of which the expansion of a chiral field is, 


$(y) = AQ) + V20W(y) + 90 F(y) (8.16) 


The Taylor expansion terminates after just a few terms because of the anticom- 
muting property of the Grassmann coordinates. As a function of the coordinate x 
the expansion may be written as follows: 


P(x) = AQ)-i0o"8 Ə A(x) + 10690 x A(x) + 20r (x) 


= 7500 9, (x)o "8 + 60 F(x) 


(8.17) 
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The key point is that 
L= [ve P(x) (8.18) 


is a invariant under supersymmetric transformations (up to a total derivative). 
After the integration some terms will disappear from the expansion of ®(x) 
leaving only: 


D(x) = A(x) + V20W(x) + 90 F(x) (8.19) 


A(x) will be associated with a complex Boson; y(x) will be associated with a 
Weyl Fermion and F(x) acts as an auxiliary field that carries no physical degrees 
of freedom. These are called the component fields of the superfield. The product of 
two chiral fields also produces a chiral field. Therefore, any polynomial, W(®), can 
be used to construct a supersymmetry invariant as 


£- f d’OW (E) = Fw(o) (8.20) 


is a supersymmetry invariant. This is used to provide a potential for the chiral field. 
The kinetic terms are described by: 


| ees $;0; = 0;0; (8.21) 


0000 


® is an anti chiral field. After expanding and extracting the 6000 term one 
obtains (up to total derivatives): 


Dot 
F*F; — |a, A[ + gà Va" y (8.22) 
One has thus constructed the following Lagrangian: 


= 1 1 
LoS DiDi looga + [no + zis Pi Pj + 581010; | (8.23) 
00 


= iQ + AY x A; + F*F; +A; F; mij (AF; x TE) 


(8.24) 
+ gijk (AiAj Fk — Viv Ax) + h.c. 


One can eliminate the auxiliary fields F;, F* in favor of the fields carrying 
quantum degrees of freedom. The equation of motion for F% is as follows: 


Fk = AL + mj A; + eic Ai Aj (8.25) 
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L = dmi Ape Ai — mi; — mtt T wo 

— &gikViVjAk — Si Vi j — Ak — F Fi f 
where the last term leads after integration on the Fjs to a potential for the fields A,A*; 
these are known as the F terms, Vr(A*,A). (Note, Vr > 0). It turns out as explained 
later that at the ground state this must vanish i.e. Vr(A*,A) = 0 if SUSY is not to 
be spontaneously broken. This in turn implies that F; = 0 for such a symmetric 
ground state. Although this is a classical analysis so far, in fact it is true to all orders 
in perturbation theory as there exists a non-renormalization theorem for such an 
effective potential in SUSY theories. The Lagrangian described above is called the 
Wess-Zumino Lagrangian (WZ). 

So far the scalar fields have been defined over simple flat manifolds. To describe 
the kinetic terms of supersymmetric Lagrangians of systems containing scalar 
fields spanning complicated manifolds it is convenient to introduce the following 
supersymmetry invariant: 


[vox (6,9). (8.27) 


K (9, @ ) is called the Kahler potential, unlike the potential W(®) but similar to 
the kinetic term introduced above, the Kahler potential depends on both ® and @. 
One may add any function of ® or @ to the integrand since these terms will vanish 
after integration. For the usual kinetic terms, K is taken to be given by K = bọ 
which produces the —ô;jð nA“ 8" AJ kinetic terms for the scalars. For the case of a 
sigma model with a target space whose metric is g;j; this metric is related to the 
Kahler potential by: 


3K 


TALJ VES) 


Sij 


The above supersymmetry invariant (8.27) which previously gave the usual 
kinetic terms in the action, produces for general K the action of a supersymmetric 
sigma model, with the target space metric given by Eq. (8.28). 


8.2.2.2 Global Symmetries 


It is possible to construct Lagrangians which have a global symmetry which does 
not commute with supersymmetry and thus assigns different quantum numbers 
to particles in the same supermultiplet. This symmetry already in its discrete 
form forbids unwanted interaction terms which strongly violate baryon and lepton 
conservation laws. Such interactions arise due to the bosonic superpartners to 
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standard model particles carrying Baryonic and Leptonic numbers. The symmetry 
also in its continuous U(1) version turns out to play a possible role in the possibilities 
to spontaneously break SUSY. It is called R symmetry. 

R-symmetry is a global U(1) symmetry that does not commute with the super- 
symmetry. Its discrete version is called R parity. The action of the R-symmetry on a 
superfield with R-character n as follows. 


Ré (0,x) = expQina) e (exp (—ia0) , x) (8.29) 


R (8, x) = exp(—2ina)® (exp (ia) ; x) (8.30) 
Since the R-charge does not commute with the supersymmetry, the component 
fields of the chiral field have different R-charges. For a superfield with R-character 


n, the R-charges of the component fields may be read off as follows: 


R (lowest component of ®) = R(A) =n, R(y) 2n—1, R(F) 2n—2 
(8.31) 


The R-charge of the Grassmann variables is given by: 
R (04) = 1, R (d@,) = —1 (8.32) 


with, barred variables having opposite R charge. The kinetic term $4 is an R 
invariant. (0000 is an invariant.) For the potential term, 


f dow (8.33) 


to have zero R charge requires that R(W) = 2. For the resulting mass term from 
W= jm $?, 


my y -- m^|AP, (8.34) 


to have vanishing R-charge requires 


R(@) = R(A) = 1, R(wW) = 0 (8.35) 


Adding the cubic term: 


A 
W3 = go (8.36) 
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produces 
V = AP AI* + AAv y. (8.37) 


This term is not R-invariant with the R-charges given by (8.35). To restore R- 
invariance requires À is assigned an R-charge of —1. This can be viewed as simply 
a book keeping device or more physically one can view the coupling as the vacuum 
expectation value of some field. The expectation value inherits the quantum numbers 
of the field. This is how one treats for example the mass parameters of Fermions 
in the standard model. There is also one other global U(1) symmetry, one that 
commutes with the supersymmetry. All component fields are charged the same with 
respect to this U(1) symmetry. Demanding that the terms in the action maintain this 
symmetry requires an assignment of U(1) charges to A, and m. 

The charges are summarized in the following table: 


U(1) U(l)n 
$ | 1 
m —2 0 
A —-3 -1 
W O0 2 


These symmetries can be used to prove important nonrenormalisation theorems. 
In particular it can be shown that the potential: 


1 1 
W zm p? + she. (8.39) 


does not change under renormalization. 

These non-renormalization theorems play an important role in analyzing the 
dynamics of supersymmetric systems and in addressing the so called hierarchy 
problem. 


8.2.2.3 Lagrangians for SUSY Gauge Theories 


A vector superfield contains spin 1 and spin 5 component fields. It obeys a reality 
condition V = V. 


V = B+O0x0x+402C +8 C — 00"9A,, 
+ i828 (X - 55" ,x) — 186 Q 2 Jordz) (8.40) 
+ 1020 (D? + à? B) 
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B,D, A, are real and C is complex. The Lagrangian has a local U(1) symmetry 
with a gauge parameter, A an arbitrary chiral field: 


V>V+i(A-A) (8.41) 


B, x, and C are gauge artifacts and can be gauged away. The symmetry is actually 
U(1)c as opposed to the usual U(1)g because although the vector field transforms 
with a real gauge parameter, the other fields transform with gauge parameters that 
depend on the imaginary part of A. 

It is possible to construct a chiral superfield, Wg, from V as follows 


]—— = 
Wa = — Uu DES D¿Wa = 0 (8.42) 


One may choose a gauge (called the Wess Zumino gauge) in which B, C and x 
vanish and then expand in terms of component fields, 


VO) = -66"0A, + 1070 — i 0A + 1990 D (8.43) 


Where A, is the vector field, F,» its field strength, A is the spin 5 field and D is 
an auxiliary scalar field. Under the symmetry (8.41), the component fields transform 
under a now U(1)g symmetry as: 


Ay > Ay — id, (B — B*),à > X4,D— D (8.44) 


Note, W is gauge invariant. The following supersymmetric gauge invariant 
Lagrangian is then constructed: 


L= fè ZT | wew, + h.c. (8.45) 
16x 
where the coupling constant t is now complex, 
t= —+i— (8.46) 


Expanding this in component fields produces, 


1 1 Tm 
L= — pague %0 Di + (&F)"" Fuy. (8.47) 


~ 4g 2g2 3272 
D is a non-propagating field. The 0 term couples to the instanton number density 


(this vanishes for abelian fields in a non-compact space). A monopole in the 
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presence of such a coupling will obtain an electric charge. The supersymmetries 
acting on the component fields are, (up to total derivatives): 


&A = J2ey 
dep = iV20"€0,A + V2€F 
&F = iv" ð yy 
bcFuy = i (cop ðv - €0,8,A) — (u < v) 
6A = ieD +o”eFuv 
êD = t6"9,4 — eo 0,X. 


(8.48) 


One may also add to the action a term linear in the vector field V, known as a 
Fayet-Iliopoulos term: 


2K [ odav =KD= / d0* W, + h.c. (8.49) 


It plays a possible role in the spontaneous breaking of SUSY. The U(1) gauge 
fields couple to charged chiral matter through the following term 


Bey f d^0d?8, exp (qi V) Di (8.50) 


Under the gauge transformation 
V—V-ci(A- 4).9; > exp(-iq; A) 9; (8.51) 


Since there are chiral Fermions there is the possibility for chiral anomalies. In 
order that the theory is free from chiral anomalies one requires: 


X qi=) = 0 (8.52) 


Writing out the term (8.50) in components produces: 


2 ; " 
£-F*F- - ijo @ + da^.) y — a (pry — pay) 


iq 
940 + 5 Aub 


There are two auxiliary fields, the D and F fields. i 
Adding the kinetic term (8.47) for the vector field and a potential, W (®) for the 
matter, gives the total Lagrangian, 


L= [ve (wow. «fem @ exp (qiV) Di + ie») (8.54) 
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this produces the following potential, 


í + T((x +5 ei) (8.55) 


So far only systems with U(1) vector fields were discussed. One can also 
consider non-Abelian gauge groups. The fields are in an adjoint representation of 
the group, A^,,A^, D^, the index a is the group index, (a = 1...dim(group)) and 


D* = DT ho) %- 


8.2.3 Supersymmetrical Particle Spectrum in Nature? 


The classification of the particles was naturally followed by an attempt to correlate 
the known particles with the algebraic results. The photon is massless to a very good 
approximation, the only known fermionic particle that at the time was considered 
massless as well was the neutrino. It was found that in the standard models the two 
cannot be members of the same multiplet. As SUSY is broken, it was at the time 
expected that the breaking be spontaneous and in case of a global symmetry this 
would lead to a massless spin one half Goldstone particle, a spin half fermion in 
the case of broken SUSY. It would have also been nice and simple if the neutrino 
would be at least the Goldstone fermion or as it has become termed a Goldstino. 
This turned out not to be consistent with experiment as well. Eventually one got 
resigned to the situation that all known particles, be they bosons or fermions, have 
supersymmetric partners which are yet to be discovered. The yet to be confirmed 
spin zero elementary particle, the Higgs, has a spin one half superpartner—the 
Higgsino. In fact in a supersymmetric model extension of the standard model at least 
two Higgs fields are required. The SUSY interaction terms can each be composed 
only of chiral, or only antichiral fields. For such interaction terms a single Higgs 
field would not permit to construct the Yukawa interaction terms needed to provide 
masses to all quarks and leptons. Also with only one field the theory would not 
be consistent as it would suffer what is called an anomaly. The particles carry a 
U(1) gauge charge, and in the presence of a single Higgsino that gauge symmetry 
would become invalid once quantum corrections are taken into account. An extra 
Higssino, and thus an extra Higgs, is required to restore the gauge invariance at the 
quantum level. The two mentioned problems get to be resolved by adding the one 
extra Higgs supermultiplet. The superpartners of the known spin one half quarks and 
leptons are denoted squarks and sleptons and are required to be spin zero bosons. 
The superpartners of the various spin one known gauge particles are termed the 
photino, wino, zino and gluino. They are assagined spin one half and are in the 
adjoint representation of the gauge group. In the LHC a major effort is planned for 
observing these particles. 
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8.2.4 Spontaneous SUSY Breaking: Perturbative Analysis 


Any attempt to relate supersymmetry to nature at the level of the known particles 
requires the symmetry to be broken. It could have been broken explicitly, in 
this section the currently known mechanisms for its spontaneous breaking are 
described. This includes the breaking at the classical level in a class of gauge 
theories (by D terms) and in theories with no gauge particles (by F terms). Also 
are described the dynamical breaking of Supersymmetry as well as its effective 
breaking in a metastable vacuum. For SUSY not to be spontaneously broken all 
the SUSY generators Q',, need to annihilate the ground state. As the Hamiltonian 
is constructed out of positive pairings of the SUSY generators, SUSY preservation 
occurs if and only if the energy of the ground state vanishes. Conversely, SUSY is 
broken if and only if the energy of the ground state does not vanish, On | G.S. >Æ 
0 iff Eg s. # 0. As the Hamiltonian of the SUSY system is non-negative, the non- 
vanishing ground state energy in the case spontaneous SUSY breaking is positive. 

As for the actual mechanism for the spontaneous breaking, it turned out that the 
breaking of supersymmetry requires a somewhat elaborate structure. 


8.2.4.1 F-terms 


Consider first a system which contains only spin zero and spin one-half particles. In 
that case the condition for SUSY not to be broken is the vanishing of the potential 
generated by the F terms. Super Symmetry is thus unbroken when the following 
equations have a solution: 


Vr =0 € Fi -0 Vi, (8.56) 


These are n (complex) equations with n (complex) unknowns. Generically, they 
have a solution. Take the example of the one component WZ model, where 


Fi 2 —A— mA + g4’. (8.57) 


This has a solution. There is no supersymmetry breaking classically. The non- 
renormalization theorem for the F terms ensures this result to be correct to all orders 
in perturbation theory. The solution is: 


V = A* A(gA — m)? (8.58) 


and hence there are actually two supersymmetric vacuum: either at « A » — m/g or 
at < A» — 0. 

Let us examine now how supersymmetry may be spontaneously broken. The 
following anecdote may be of some pedagogical value. It turns out that a short time 
after supersymmetry was introduced arguments were published which claimed to 
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prove that supersymmetry cannot be broken spontaneouly at all. Supersymmetry 
resisted breaking attmepts for both theories of scalars and gauge theories. One could 
be surprised that the breaking was first achieved in the gauge systems. This was done 
by Fayet and Illiopoulos. The presence in the collaboration of a student who paid 
little respect to the general counter arguements made the discovery possible. Fayet 
went on to discover the breaking mechanism also in supersymmetric scalar theories 
as did O'Raighfeartaigh. 

We will describe four examples of mechanisms of breaking Super Symmetry. 
The first will be for theories not containing gauge particles. The second for systems 
containing gauge particles, the third is of a dynamical breaking of Super Symmetry 
and the fourth occurs if our universe happens to be in a long lived metastable vacuum 
of positive kinetic energy. 


8.2.4. SUSY Breaking in Theories with Scalars and Spin One Half 
Particles by F Terms 


The Fayet-O’ Raifeartaigh potential contains three fields this is the minimal number 
needed in order to break supersymmetry. It is: 


Lpotential = APo + m4» + g690, Pı -F h.c. (8.59) 


Minimizing the potential leads to the following equations: 


0 — A+ 24010, 
0 = mo» + 22094, (8.60) 
0 = mdi 


These cannot be consistently solved so there cannot be a zero energy ground state 
and supersymmetry must be spontaneously broken. To find the ground state one 
must write out the full Lagrangian including the kinetic terms in component fields 
and then minimize. Doing so one discovers that in the ground state A; = A2 = 0 
and Ao is arbitrary. The arbitrariness of Ao is a flat direction in the potential, the 
field along the flat direction is called a moduli. Computing the masses by examining 
the quadratic terms for component fields gives the following spectrum: the six real 
scalars have masses: 0,0,m2m2,m? + 2g; and the Fermions have masses: 0,2 m. 
The zero mass Fermion is the Goldstino. We turn next to breaking of supersymmetry 
theories that are gauge invariant. 


8.2.4.3 SUSY Breaking in Supersymmetric Gauge Theories 


The potential for the supersymmetric gauge theory was obtained to be: 


e 2 
$ © (aK +> id?) (8.61) 
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The first term is the F-term and the second is the D-term. Both these terms need 
to vanish for supersymmetry to remain unbroken. 

Some remarks about this potential are in order: 

Generically, the F-terms should vanish since there are indeed n equations for 
n unknowns. If < 9; > = 0, that is if the U(1) is not spontaneously broken then 
supersymmetry is broken if and only if Kg 1, ZZ 0. When K = 0 and the F-terms have 
a vanishing solution then so will the D-term and there will be no supersymmetry 
breaking. 

These ideas are demonstrated by the following example. Consider fields &,, ®2 
with opposite U(1) charges and Lagrangian given by: 


1 = = 
L= n (W* Wa + h.c.) + By exp(eV) D1 + Bo exp (—eV) 2 +m $414» + h.c. - 2KV 
(8.62) 
This leads to the potential: 
1 2 * * 
V= ;P TFEPFTFjE (8.63) 
where 


Dt K-5(ATAi — A34?) 20 
Fi mA», =0 (8.64) 
Fə + mA} =0 


Leading to the following expression for the potential: 


1 1 1 1 
V= ak + (r + eK) ATA, + (ve - s) A342 + gc (Ati — Až A2)” 
(8.65) 


Consider the case, m? > leK. The scalars have mass, ,/m2 + leK and 
2 2 


Jm? — ieK . The vector field has zero mass. Two Fermions have mass m and 
one Fermion is massless. Since the vector field remains massless then the U(1) 
symmetry remains unbroken. For K Z 0, supersymmetry is broken as the Bosons 
and Fermions have different masses. (For K — 0 though the symmetry is restored.) 
The massless Fermion (the Photino) is now a Goldstino. Note that a trace of the 
underlying supersymmetry survives as one still has TrM?g = TrM?; even after 
the breaking of supersymmetry. Mg and Mf are the bosonic and fermionic mass 
matrices respectively. 


Next, consider the case, m2 


< ieK; at the minimum, A; = 0,4» = v where 
1 2 
eK —m 
v? 242 7—. 
e 
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The potential expanded around this minimum becomes, with A = A; and A= 
A2 — v: 


€ 
+ 2m7A*A = (4e?) (4 (A ro) (8.66) 


The first term implies that supersymmetry is broken for m > 0. The photon is 


massive, m, = ies implying that the U(1) symmetry is broken as well. The 


ENTE ND 
Higgs field, (å + Ar) has the same mass as the photon. Two Fermions have 


non-zero mass and there is one massless Fermion, the Goldstino. 
In the above example there is both supersymmetry breaking and U(1) symmetry 
breaking except when m = 0 in which case the supersymmetry remains unbroken. 
Next consider a more generic example where there is U(1) breaking but no 
supersymmetry breaking, ® is neutral under the U(1) while P has charge +1 and 
®— has charge —1. The potential is given by: 


1 
L= zmo? +uL- +ADD D +h.c. (8.67) 


There are two branches of solutions to the vacuum equations (a denotes the 
vacuum expectation value of A, etc.): 


—A 
aa- =0, a = —— (8.68) 
m 
which leads to no U(1) breaking and 
1 
cce (.-“). pack (8.69) 
8 g 8 
which breaks the U(1) symmetry. 
Note, the presence of a flat direction: 
a} > ea}, a- — e ??- (8.70) 


leaves a—a+ fixed and the vacuum equations are still satisfied. 

Typical generic supersymmetry breaking requires that some of the equations 
for the vanishing of the potential to be redundent. Such could be the case if the 
system had an extra symmetry such as an R symmetry mentioned before. In the 
examples above containing only scalar fields this was indeed the case. The absence 
of a zero energy solution led also to a spontaneous breaking of the R-symmetry. 
This should lead to the presence of a Goldstone Boson corresponding to the broken 
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U(1)pg. Inverting this argument leads to the conclusion that supersymmetric breaking 
in nature is not easy to obtain since we do not observe even such a particle. This 
argument was revisited in recent years. 

So far only systems with U(1) vector fields were discussed. To have a non- 
vanishing FayetIliopoulos term there needs to be a U(1) factor in the gauge group. 
To obtain a breaking of SUSY in a gauge system with does not have a U(1) factor 
one needs to consider dynamical SUSY breaking. This is a more complex problem 
and new tools were developed to explore this possibility. The functional form of 
such a breaking allows to obtain a scale of SUSY breaking which is naturally much 
smaller than the relevant cutoff of the problem. This is reflected by the relation: 


MSUSY-breaking = Meutotf eXp (—c/8 (Mcutott)) - (8.71) 


6.2.5 Dynamics of SUSY Gauge Theories and SUSY Breaking 


The description of the mechanism of dynamical breaking of SUSY is preceded by 
a survey of the general possible phase structure of gauge theories as well as its 
concrete realizations in SUSY gauge theories. 


8.2.5.1 Phases of Gauge Theories 


The phase structure of gauge theories can be introduced by analyzing them as 
statistical mechanical systems regulated by a finite lattice. The basics can be 
illustrated by considering D — 4 lattice gauge theories, in particular those for which 
the gauge fields which are Zy valued. The system has a coupling g. 

The effective "temperature" of the system is given by, T — n 

For a given theory there is a lattice of electric and magnetically charged operators. 
The electric charge is denoted by n and the magnetic charge by m. An operator with 
charges (n, m) is perturbative, i.e., it is an irrelevant operator and weakly coupled to 
system, so long as the free energy, F » 0, i.e., 


2 
2 m C 
T + — > —, 8.72 
n +7 FH (8.72) 


however, when the free energy is negative for the operator (n,m), it condenses 
indicating the presence of a relevant operator and hence an infra-red instability, this 
occur when, 


2 
2 m C 
T+—<—. 8.73 
nU T s N ( ) 


Keeping N,C fixed and vary T. 
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Fig. 8.1 Different possible phases 


The system has three phases depending which operators condense. At small T, 
there is electric condensation which implies that there is electric charge screening, 
magnetic charges are confined, and the log of the Wilson loop is proportional to the 
length of the perimeter of the loop. (This is called the Higgs phase). 

At high T, magnetic condensation occurs, this is the dual of electric condensation. 
Magnetic charges are screened, electric charges are confined and the log of the 
Wilson loop is proportional to the area. (This is called the confinement phase.) For 
intermediate values of T it is possible that there is neither screening of charges nor 
confinement, this is the Coulomb phase (Fig. 8.1). 

In the presence of a 0 parameter, an electric charge picks up a magnetic charge 
and becomes dyonic. 


n =n+ La (8.74) 
2x 
This lead to a tilted lattice of dyonic charges and one may condense dyons with 
charges (719,mo). This leads to what is called oblique confinement with the charges 
commensurate with (719,719) being screened and all other charges being confined. 
These ideas relate to the gauge theories of the standard model. For QCD it 
was suggested that confinement occurs due the condensation of QCD monopoles. 
This is a magnetic superconductor dual to the electric one which describes the 
weak interactions. It is difficult to study this phenomenon directly. The Dirac 
monopole in a U(1) gauge theory is a singular object; however by embedding the 
monopole in a spontaneously broken non-Abelian theory with an additional scalar 
field one may smooth out the core of the monopole and remove the singularity. 
One may proceed analogously, by enriching QCD; adding scalars and making the 
theory supersymmetric one can calculate the condensation of monopoles in a four 
dimensional gauge theory. 
In general the possible phase structure of gauge theories and their actual realiza- 
tions were obtained by using various approximation schemes. While supersymmetry 
has not yet disclosed if it part of nature in some form or another, in its presence, the 
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Fig. 8.2 Possible phases of Y 
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phase structure of gauge theories was exactly obtained in some cases. Most earlier 
exact important results in field theories, such as asymptotic freedom, were obtained 
in circumstances in which the couplings are weak. SUSY enables to obtain also 
results in the strong coupling regime. The analytic control seems to become larger 
the more supersymmetries the system possesses. This has been achieved in four 
dimensions for supersymmetric gauge theories with N = 1,2,4 supersymmetries. 
There are many new methods that have been utilized and the phase structures of 
these theories have been well investigated. Novel properties of these theories have 
been discovered such as new types of conformal field theories and new sorts of 
infra-red duality. To these we turn next (Fig. 8.2). 


8.2.5.2 SUSY QCD: The Setup 

The goal will be to examine theories that are simple supersymmetric extensions of 
QCD. Consider the case of a N = 1 vector multiplet with gauge group SU(N), and 
Nr chiral multiplets in the fundamental representation of SU(Nc), and Nr chiral 


multiplets in the antifundamental representation. The Lagrangian is: 


L = f (-it)TrW*W,d70 + h.c. 


2 " 8.75 
+ Qrexp(-2V) Qr + QexpQV) OF lagoo + mr OF Oloo s 


where the coupling is: 


t=—+i— (8.76) 
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Apart from the local SU(Nc) gauge symmetry, the fields are charged under the 
following global symmetries. 


SU(Nr)z x SU(NF)R x U(Dy x U(1)4 x U(Dnc 


Qi Nr 1 1 1 1 
OF l Ny ESI 1 1 
Wy 1l 1 0 0 1 


When Nc = 2, because 2 ~ 2, the global flavor symmetry is enhanced to 
SOQNr), x SOQNr)n. 

There is an anomaly of the U(1)4 x U(1)g symmetry. A single U(1) symmetry 
survives the anomaly. This is denoted as U(1)s and is a full quantum symmetry. 
The adjoint Fermion contributes 2Nc x R(A) to the anomaly. The Chiral Fermions 
contribute, 2Nr x Rr. R(A) = 1, while Rr is now chosen so that the total anomaly 
vanishes, 


2Nc -2RpgNr — O0. (8.77) 
This leads to 
Nc 
Rp = -— 8.78 
F NT. (8.78) 


The Bosons in the chiral multiplet have an R-charge one greater than the 
Fermions in the multiplet. Thus, 


Nc | Nr-— Nc 


Rg=1- 8.79 
B Ne We (8.79) 
The non-anomalous R-charge, is given by: 
Nc 
R= Rc — — Qa, (8.80) 
Nr 


where Rc is the classical R-charge and A is the classical U(1),5A charge. Following 
non-anomalous global charges: This leads to the 


SU(QNrE), x SU(NF)R x U(Dy x U(l)n 


Q, Nr e: 1 Ne AC 
of Nr -1 Ae -Ne 
Wa 1 1 0 1 


One is now ready to identify the classical moduli space. 
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8.2.5.3 The Moduli Space 


The classical moduli space is given by solving the D-term and F-term equations: 


D' = QT" Qr — OrT^ Oy 
For = —mFQ (8.81) 
Fog co cR 


For Nr = 0 or for Nr Æ 0 and mpr ¥ 0, there is no moduli space. Note, the vacuum 
structure is an infra-red property of the system hence having mr Æ 0 is equivalent 
to setting Nr = 0 in the deep infrared. 

Consider the quantum moduli space of the case where Nr = 0. One can show that 
the number of zero energy states of the system is no smaller than The Witten index, 
Tr(—1)* = Nc i.e. the rank of the group +1. This number is larger than zero and 
thus there is no supersymmetry breaking in these systems. There are 2Nc Fermionic 
zero modes (from the vector multiplet). These Fermionic zero modes break through 
instanton effects the original U(1)g down to Zowc. Further breaking occurs because 
the gluino two point function acquires a vacuum expectation value which breaks the 
symmetry down to Z2. This indeed leaves Nc vacua. The gluino condensate is: 


< AA >= exp on a (8.82) 
Nc G 


where Aywc is the dynamically generated scale of the gauge theory and 
k = L...,Nc — 1 label the vacua. Chiral symmetry breaking produces a mass 
gap. Note, because chiral symmetry is discrete there are no Goldstone Bosons. 
Further details of quantum moduli spaces will be discussed later. 

Consider the case where mr = 0 and 0 < Nr < Nc. The classical moduli space 
is determined by the following solutions to the D-term equations: 


(8.83) 


oO ocoooo 


0... 

. 0... 
Q=Q= . 0... 
0... 

0...0) yeye 


Where the row indicates the flavor and the column indicates the colour. There are 
Nr diagonal non-zero real entries, aj. (To validate this classical analysis the vacuum 
expectation values must be much larger than any dynamically generated scale, i.e., 
dj >> A. The gauge symmetry is partially broken: 


SU (Nc) > SU (Nc — Nr). (8.84) 
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This is for generic values of a;. By setting some subset of a; to zero one may break 
to a subgroup of SU(Nc) that is larger than SU(Nc — Nr). Also, if Nr = Nc — 1 
then the gauge group is complete broken. This is called the Higgs phase. 

The number of massless vector Bosons becomes 


Nc? (uc Np)? 1) =2NcNF Ne, (8.85) 


the number of massless scalar fields becomes, 


2NcNr — (2NcNr = Ne) = N2. (8.86) 
The matrix 
Mi; = 0;0; (8.87) 


forms a gauge invariant basis. The Kahler potential is then, 


K = 2Tr,/(MM). (8.88) 


When singularities appear, i.e. detM = 0, it signals the presence of massless 
particles as well as of enhanced symmetries. 
When Nr > Nc, one has the following classical moduli space, 


a, O a, 0 
0 a2 0 a 
Q= |: , Õ= i (8.89) 
aNc Nc 
0 0 0 0 0 0 
3 ud : 0 
00-- 0 T 00-- 0 NNG 
with the constraint that 
lai? — lã? = p. (8.90) 


Generically the SU(Nc) symmetry is completely broken. However, when 
ai = áj = 0 then a subgroup of the SU(Nc) can remain. 
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We will now consider some special cases, first the classical moduli space for 
Nr = Nc. The dimension of the moduli space is given by: 


2Nc? — (Nc? - 1) = Ne? 418 Ng? +1 (8.91) 


There are Np? degrees of freedom from M-;; and naively one would have two 
further degrees of freedom from: 


. i E ~; ~i 
B = éi ine Qh Qiyo Boca OF --- Oi (8.92) 


"n7 inp 
There is, however, a classical constraint: 


det M — BB =0 (8.93) 


which means M, B and B are classically dependent. This leaves only Nz +1 
independent moduli. 

Generically, as well as the gauge symmetry being completely broken, the global 
flavor symmetry is also broken. There is a singular point in the moduli space where 
M=0=B=B. 

Next, consider the case, Nf = Nc + 1, again there are Nz. moduli from M; i: 
There are also, 2(Nc + 1) degrees of freedom given by: 


inc ANC 


Bi = €ii,.. iw. Q5. jeg Qj. B; = €i, inc Q5 Sai Q ivp (8.94) 
However, there are again the classical constraints: 
deM — M;, Bi Bi =0 
1J (8.95) 
M;Bi = M;;B;= 0 


giving again an NZ + 1 dimension moduli space. (The moduli space is not 
smooth). There is a generic breaking of gauge symmetry. In all these cases 
the potential has flat directions of zero energy, SUSY is unbroken. The non- 
renormalization theorem extends this result to all order in perturbation theory, it 
will be the non-perturbative effects which will lift the vacuum degeneracies in some 
cases and will lead to dynamical breaking of SUSY. 


8.2.5.4 Quantum Moduli Spaces/Dynamical SUSY Breaking 


A rich structure emerges. One is required to examine on a case by case basis the role 
that quantum effects play in determining the exact moduli space. Quantum effects 
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Fig. 8.3 Potential for V 
1 < Nr < Nc, it has no 
ground state 


det [M] 


both perturbative and nonperturbative can lift moduli. In what follows the quantum 
moduli space is examined for the separate cases: 1 


3N 
< Nr < Nc - 1, Nr = Nc. Nr, Nc +1, Nc +1 < Nps, 


3Nc 
> < Nr <3Nc,Nr =3cN,and Np > 3Nc. 


We start with the study of the quantum moduli space for 0 < Nr < Nc. 
Classically, the dimension of the moduli space is N2 from Q, Q. The following 
table summarizes the charges under the various groups. 


SU (Nc) SU(Nr)z SU(Nr)r UDy UD, UDR, || U(g 


Qi Nc Nr 1 1 1 1 Newe 
Q^ Nc 1 Ne -1 1 1 MAC 
Qi ec m 4 1 1 0 2NF 2Nc 0 
M 1 Nr Nr 0 2 2 - AS 
det M 1 1 1 0 2Nr 2Nr 2(Nr- NC) 


A, the dynamically generated QCD scale is assigned charges as m and g were 
before. The power 3Nc — Nr is the coefficient in the one loop beta function. There 
is no Coulomb phase so W, does not appear. 

The symmetries imply, the superpotential, W, has the following form: 


we (Arete) det Me (8.96) 


a, b are to be determined. c is a numerical coefficient, If c does not vanish, the 
classical moduli space gets completely lifted by these nonperturbative effects (Fig. 
8.3). 
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One examines the charges of W under the various symmetries. Automatically, the 
charges of W for the flavor symmetries, SU(N), x SU(Nr)g and the U(1)y vanish. 


If one requires the U(1), charge to vanish then this implies a = —b. Requiring 
the U(1)s charge to vanish implies that b = Won: These restrictions fix: 
A3Nc-Nr NENF 
W = c| ———— 8.97 
«( det M ) cm) 


For non-vanishing c, all the moduli are now lifted and there is no ground state. 

What is the value of c? This is a difficult to calculate directly unless there is 
complete higgsing. For Nr = Nc — | there is complete symmetry breaking and one 
can turn to weak coupling. From instant on calculations one calculates that c 4 0 
and the prepotential for the matter fields is 


A2Nct1 
W~ ( ) . (8.98) 
det M 
One may now go to Nr « Nc — 1 by adding masses and integrating out the heavy 
degrees of freedom. This produces 


i L 
« M i min = (m) (An det m) Me. (8.99) 


J 


Thus in this region the dynamical effects remove the supersymmetric vaccum. In 
fact the system has no ground state and supersymmetry is broken dynamically. One 
can go one step further and consider the possibility that the effective potential above 
is modified so as to have a local minimum with positive vacuum energy. It is possible 
to construct many such examples and consider that supersymmetry seems broken 
as a result of the universe being for the (long) time being in the metastable state. 
Eventually the system may tunnel to another lower energy vacuum and perhaps 
eventually will reach a SUSY vacuum. In the analysis of such systems once needs 
also to take into account gravity. In the cases which follow supersymmetry is not 
broken, they are described to illustrate the rich structure which emerges once one 
can treat the system when it strongly interacts. Perhaps also structures like those of 
the dyonic condensates will yet play a role somewhere in nature (Fig. 8.4). 

We now turn to study the dynamics and quantum moduli space for the Nr > Nc 
case. In this situation there is a surviving moduli space. In the presence of a mass 
matrix, mj; for matter one obtains 


< Mij >= (m=) (A m)”. (8.100) 


J 
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Fig. 8.4 Potential with finite Va 
masses has a ground state. 
Mmin > oo as m — 0 


min 


Previously, for the case of Nr « Nc, it turned out that m — 0 implied « M' j^ 
oo thus explicitly lifting the classical moduli space. For Nr > Nc it is possible to 
have m — 0 while keeping < Mi; > fixed. 

Consider the case where Nr — Nc. Quantum effects alter the classical constraint 
to be: 


det M — BB = A?NC, (8.101) 


This has the effect of resolving the singularity in moduli space. The absence of a 
singularity means there will not be additional massless particles. 

The physics of this theory depends on the position in moduli space of the vacuum. 
For large, M/ B/ B one is sitting in the Higgs regime; however, for small M/B/ B 
one is in the confining regime. Note that M cannot be taken smaller than A. As 
the system has particles in the fundamental representations there is no actual phase 
transition between these two regimes, the transition is of a quantitative nature. 
In addition global symmetries need to be broken in order to satisfy the modified 
constraint equation. 

Consider some examples: with the following expectation value, 


< Mi; >= jA, « BB >=0, (8.102) 


the global symmetries are broken to: 


SU(NF)y x Ug(1) x Un(1), (8.103) 
and there is chiral symmetry breaking. When, 


<M';>=0, <BB>#0 (8.104) 
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then the group is broken to: 


SU(Nr), x SU(Nr)g x Ur() (8.105) 


which has chiral symmetry and also has confinement. This is an interesting situation 
because there is a dogma that as soon as a system has a bound state there will be 
chiral symmetry breaking (Fig. 8.5). 

The dynamics for the case Nr = Nc + 1 brings about some different dynamics. 
The moduli space remains unchanged. The classical and quantum moduli spaces 
are the same and hence the singularity when M = B = B = 0 remains. This is not 
a theory of massless gluons but a theory of massless mesons and baryons. When, 
M, B, B # 0 then one is in a Higgs/confining phase. At the singular point when, 
M = B = B = O there is no global symmetry breaking but there is “confinement” 
with light baryons. 
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Fig. 8.6 Two systems with a System I System II 
different ultra-violet behavior 

flowing to the same infra-red 

fixed point. 


There is a suggestion that in this situation, M, B, B become dynamically 
independent. The analogy is from the nonlinear sigma model, where because of 
strong infrared fluctuations there are n independent fields even though there is a 
classical constraint. The effective potential is: 


Weg (ar j BiB! — det M) (8.106) 


= ANNC- 
the classical limit is taken by: 


A—0 (8.107) 


which in turn imposes the classical constraint. 

For higher values of the number of flavors Nr the plot thickens even more as new 
duality emerge. Infrared dualities which in some circumstances uncover new types 
of dynamical structures. 


8.2.5.5 Infra-red Duality 


Two systems are called infra-red dual if, when observed at longer and longer length 
scales, they become more and more similar (Fig. 8.6). 

It has been observed that the following set of N — 1 supersymmetric gauge 
theories are pairwise infra-red dual called Seiberg dualities. 


l System Dual System : l l 
Gauge Group #flavors Gauge Group #flavor #singlets 
SU(Nc)SO(Nc)Sp(Nc) | NFNF2NF |SU(NF — Nc) | NFNF2NF | N2NZN2 


SO(Nr — Nc + 4) 
[SPNF — Nc — 2) 


For a given number of colors, Nc, the number of flavors, Nr, for which the 
infrared duality holds is always large enough so as to make the entries in the table 
meaningful. Note that the rank of the dual pairs is usually different. Lets explain why 
this result is so powerful. In general, it has been known for quite a long time that 
two systems which differ by irrelevant operator have the same infra-red behavior. 
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In these cases the UV structure of the Infrared dual theories is very different, 
the dual systems have a different numbers of colors. The common wisdom in 
hadronic physics has already identified very important cases of infra-red duality. 
For example, QCD, whose gauge group is SU(Nc) and whose flavor group is 
SU(NF) x SU(Nr) x U(1), is expected to be infra-red dual to a theory of massless 
pions which are all color singlets. The pions, being the spin-O Goldstone Bosons 
of the spontaneously broken chiral symmetry, are actually infra-red free in four 
dimensions. We have thus relearned that free spin-O massless particles can actually 
be the infra-red ashes of a strongly-interacting theory, QCD whose ultraviolet 
behavior is described by other particles. By using supersymmetry, one can realize 
a situation where free massless spin-4 particles are also the infra-red resolution 
of another theory. This duality allows for the first time to ascribe a similar role 
to massless infra-red free spin-1 particles. Massless spin-1 particles play a very 
special role in our understanding of the basic interactions. This comes about in the 
following way: Consider, for example, the N — 1 supersymmetric model with Nc 
colors and Nr flavors. It is infra-red dual to a theory with Ne — Nc colors and 
Nr flavors and Nz color singlets. For a given Nc, if the number of flavors, Nr, 
is in the interval Nc + 1 < Nr « MC, the original theory is strongly coupled 
in the infra-red, while the dual theory has such a large number of flavors that it 
becomes infrared free. Thus the infra-red behavior of the strongly-coupled system 
is described by infrared free spin-1 massless fields (as well as its superpartners), i.e. 
infrared free massless spin-1 particles (for example photons in a SUSY system) 
could be, under certain circumstances, just the infra-red limit of a much more 
complicated ultraviolet theory. This is the first example of a weakly interacting 
theory in which spin one particles that in the infra-red may be viewed as bound 
states of the dual theory. The duality has passed a large number of consistency 
checks under many circumstances. The infrared duality relates two disconnected 
systems. From the point of view of string theory the two systems are embedded in a 
larger space of models, such that a continuous trajectory relates them. An additional 
new consequence of this duality follows for the case 


3Nc 


the two dual theories are both asymptotically free in the UV and are describe 
by the same nontrivial conformal field theory in the infrared. The panorama of 
these structures is given in Fig. 8.8. Finally a class of examples was found among 
N = 2 SUSY conformal systems for which there are two very different Lagrangian 
descriptions as far as the local symmetries are involved which are actually identical 
at all distance scales. 
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8.2.5.6 More General Matter Composition of SUSY Gauge Theories 


One can enrich the structure of the theory by adding Na particles in the adjoint 
representation. At first one has no matter in the fundamental representation and 
scalar multiplets which are adjoint valued. The potential for the scalars, @; is given 
by: 


V = (lg, 92. (8.108) 


This potential obviously has a flat direction for diagonal ọ. The gauge invariant 
macroscopic moduli would be Trq*. Consider the non-generic example of Nc = 2 
and N, = 1, the supersymmetry is now increased to N = 2. There is a single complex 
modulus, Tre. Classically, SU(2) is broken to U(1) for Tro? Æ 0. One would expect 
a singularity at Try” = 0. The exact quantum potential vanishes in this case. 

Naively, one could have expected that when Tro? is of order A or smaller, one 
would expect that the strong infra-red fluctuations would wash away the expectation 
value for Trg? and the theory would be confining. The surprising thing is that 
when SU(2) breaks down to U(1), because of the very strong constraints that 
supersymmetry imposes on the system, there are only two special points in moduli 
space and even there the theory is only on the verge of confinement. Everywhere 
else the theory is in the Coulomb phase. At the special points in the moduli space, 
new particles will become massless. 

One can examine the effective theory at a generic point in moduli space where 
the theory is broken down to U(1). The Lagrangian is given by, 


= [ee Im (ten (Trg, g 4) waw”) (8.109) 


The Teff is the effective complex coupling which is a function of the modulus, 
Trg’, the original couplings and the scale, A. This theory has an SL(2,Z) duality 
symmetry. The generators of the SL(2,Z) act on r, defined by (8.76), as follows: 


1 
T> ——, T rt-«l (8.110) 
T 
This is a generalization of the usual U(1) duality that occurs with electromag- 


netism to the case of a complex coupling. Recall the usual electromagnetic duality 
for Maxwell theory in the presence of charged matter is: 


E— B,B——E,e—m,m — —e. (8.111) 


This generalizes to a U(1) symmetry by defining: 


E+iB,e+im. (8.112) 
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The duality symmetry now acts by: 
E +iB — exp(ia)(E+iB),e+im — exp(ia) (e - im). (8.113) 


For the SU(2) case the moduli are given by u — Tro?, for SU(Nc) the moduli are 
given by uy = Tre“, k = 2, ..., Nc. The classical moduli space is singular at times, 
there are no perturbative or nonperturbative corrections. 

The dependence of t on the moduli coordinate u was found. The vast number of 
results and literature on this will not be described here. Briefly: 

The following complex equation, 


y? 2 ax? 4 bx? +cx+d (8.114) 


determines a torus. The complex structure of the torus, Ttorus Will be identified with 
the complex coupling Teff. a,b,c,d are known holomorphic functions of the moduli, 
couplings, and scale, and so will implicitly determine T torus. 

When y(x) and y (x) vanish, for some value of x, t is singular. Therefore, 


and the effective coupling vanishes. This reflects the presence of massless charged 
objects. This occurs for definite values of u in the moduli space. These new massless 
particles are monopoles or dyons. The theory is on the verge of confinement. For 
N — 2 supersymmetry that is the best one can do. The monopoles are massless but 
they have not condensed. For condensation to occur the monopoles should become 
tachyonic indicating an instability that produces a condensation. One can push this 
to confinement by adding a mass term: m Trg?, or generally for SU(Nc) the term: 


8W = giu. (8.116) 


This breaks N — 2 supersymmetry down to N — 1. The effective prepotential is 
now: 


W = M (ux) qd  gkuk (8.117) 
then 
oW aw 
— =0, —— =05 M (Kuz >) =0, 0y,M (< uk >) «qq >= —gt 
Ou, ð (qq) 
(8.118) 


Since generically, 


Ou, M (< uk >) Z0 (8.119) 
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There will be condensation of the magnetic charge, confinement has been 
demonstrated to be indeed driven by monopole condensation. A monopole is usually 
a very heavy collective excitation. It is only the large amount of SUSY which allows 
one to follow the monople as it becomes massless and even condenses. 


6.2.6 Dynamics of SUSY Gauge Theories with N > 1 SUSY 
8.2.6.1 N = 4 Supersymmetry 


In the presence of this large amount of supersymmetry in four space-time dimen- 
sions the particle content was described in an earlier section. It consists of spin 
one, spin one half and spin zero particles. The particles are all in the adjoint 
representation of the gauge group. They fall into representations of the SU(4) global 
symmetry group as well. The full Lagrangian is fully dictated by the symmetry. The 
large symmetry leads to several properties which can be demonstrated. 


* The theory is scale invariant quantum mechanically. This was shown to all orders 
in perturbation theory as well as non-perturbatively. This served as an example 
of non-trivial four dimensional scale invariant theories. 

* The theory has thus a meaningful coupling constant on which the physics truly 
depends (unlike massless QCD in which dimensionless quantities do not depend 
on the coupling). Moreover the coupling can be complexified by adding the 0 
parameter. (In the absence of a chiral anomaly the theory truly depends also on 
0 even though massless fermions are present). The theory is invariant under the 
modular group SL(2,Z), this group relates in particular small and large values of 
the coupling. 

* The theory has flat directions along the scalar fields for any value of the coupling. 
The different points along the flat directions are not generically related by any 
symmetry. Each point characterizes a different vacuum choice for the system. 
These different vacua are called moduli. There is a special point in the moduli 
space and that is the point at the origin of field space where all scalar fields obtain 
a Zero expectation value. At that point the theory is realized in a scale invariant 
manner. The massless fields rendering the theory with a rather complex analytic 
structure. Choosing different vacua along the moduli space leads to different 
residual gauge symmetries, as the scalar fields are in the adjoint representations 
the residual gauge group is at least U(1)/ where r is the rank of the group. In each 
of these vacua the scale symmetry is spontaneously broken leading to a presence 
of a dilaton, the Goldstone Boson of broken scale invariance, in the spectrum. 
The vacuum energy is the same in all the phases associated with the different 
vacua choice. It has no dependence on any of the expectation values of the scalar 
fields. The spectrum includes massless and massive gauge particles. 

* In the broken phases of the theory the conditions are appropriate for the presence 
of various solitons in the system. In general they contain particles particles which 
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have both electric and magnetic charges, called dyons. The massess of many of 
these particles, called BPS states is protected by the large SUSY to the extent that 
the mass dependence on the coupling is exactly known. This has applications in 
the counting microscopically the entropy of some black holes. 

* Moreover, a hidden wish of theoreticians is that not only will one theory describe 
all physical phenomena but that that theory be exactly solvable. Given the 
complexity of four dimensional field theory this hope was suppressed. It had 
resurfaced when it was uncovered that quite a few properties of the N = 4 theory 
are exactly calculable. The system seems to have a large number of conserved 
quantities and this results in many so called integrability features. 

* As will be discussed in the section on string theory, there is a large body of 
evidence that N — 4 theory encodes in it the information of special string theories 
which include gravity and black holes. These are string theories for strings 
propagation on a manifold part of which has a negative curvature and a negative 
cosmological constant. 


To conclude, supersymmetric gauge theories have a very rich phase structure and 
many outstanding dynamical issues can be discussed reliably in the supersymmetric 
arena that are hard to address elsewhere. 


8.2.7 Gauging Supersymmetry 


During most of the second part of the twentieth century local symmetries were 
at center stage. The model unifying the electromagnetic and weak interactions 
actually united them mainly by using the concept that gauge theories do describe 
both. The stronger unification using only one non-semi simple group is yet to be 
achieved. The gauge theories have allowed to make precise calculable predictions 
for experimentally measurable quantities. Yet in the last year of that century some 
scientists suggested to move on and accept that gauge symmetries are simply 
redundent descriptions (choosing not to emphasize that this description allows for 
a local description of the theory). Be that as it may, at the time it was natural to 
promote global symmetry into a local symmetry. The result was very rewarding, it 
turned out that the “gauge particle” of local supersymmetry is a masselss fermion 
of the spin 3/2, called the gravitino, whose partner in the same supermultiplet is 
a massless spin two particle, the graviton. Local supersymmetry led to general 
coordinate invariance and the presence of gravity. This is called supergravity 
(SUGRA). 

Lagrangians invariant under this local symmetry were found. Writing them down 
required even more efforts than those needed for the global supersymmetry case. 
Nevertheless this was achieved and a superfield notation was discovered as well. 
The Lagrangians are rather lengthy and we do not display them here. The system 
had additional interesting features. 
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* There is a Higgs like feature for such systems. The massless gravitino became 
massive when spontaneous breaking of supersymmetry occurred. The would-be 
goldstino became part of the massive gravitino. This may resolve the issue of the 
missing Goldstino. 

e SUSY may persist in the presence of a negatively valued cosmological constant. 
Thus the spontaneous breaking of supersymmetry may be fine tuned so as to 
obtain a zero value or very small value for the cosmological constant. This is done 
by balancing the positive vacuum energy resulting from breaking supersymmetry 
against the value of the negative cosmological constant. 

* The presence of gravity renders the Lagrangians to be superficially non- 
renormalizable. In fact in these theories the ultra violet divergences are much 
less severe than what would be expected by power counting. In the presence of 
local SUSY the supersymmetry can be enhanced up to N — 8 in four dimensions. 
Such a theory is intimately related to a ten dimensional theory with N — 1 
supersymmetry in ten dimensions. Some scientists have the hope that this very 
special theory is in fact finite. Time will tell. 

* Once global SUSY is embedded in supergravity one can imagine also terms 
which softly break supersymmetry and do not result from spontaneous breaking, 
i.e. one can add relevant terms to a Lagrangian describing for low energies a 
systems of particles containing superparticles with non-supersymmetric masses 
and interactions. 


8.2.8 The Hierarchy Problem 


SUSY was uncovered on the route of circumventing the no go theorem concerning 
the unification of a internal and space time symmetries into a unified group as well 
as attempting to find a symmetry which relates the different couplings in a rather 
general Lagrangian involving both bosons and fermions. It was taken up again when 
it was decided to declare the very large value of the ratio of the planck scale and 
the weak interaction scale as a problem. On a more technical level the problem was 
stated as follows. Consider a theory with an interacting scalar particle. Notice that to 
this date no fundamental spin zero elementary particles were observed, the discovery 
of an elementary spin zero Higgs particle would change this unfamiliar situation, 
a situation in which the simplest realization of a symmetry is not manifested in 
nature on an elementary level. No matter what the original mass of the scalar 
is, the interactions shift the mass. The mass shift is divergent as is the case for 
renormalizable theories. However according to the renormalization group ideas one 
always encodes the present knowledge valid up to an energy scale A in an effective 
low energy theory. In that case the mass shift is proportional to the cutoff A. As 
long as there is no physical scale near that of the weak interaction's scale of 1 
TeV one needs to fine tune the input initial mass to obtain a Higgs mass of the 
weak interaction scale. This is to be contrasted with the case of a fermion mass. 
In that case the fermion mass is shifted by the interactions by an amount which is 
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proportional only to the logarithem of the cut off and is proportional to the initial 
mass of the fermion. The shift vanishes when the initial fermion mass vanishes. The 
reason behind the vanishing of the mass of the fermion is that for a massless fermion 
the system obtains a new symmetry, chiral symmetry. This symmetry should be 
restored in the zero mass limit. As long as that symmetry is not broken the mass 
remains zero. If the symmetry is dynamically broken the mass shift can be very 
small. In the case of supersymmetrical systems it is the fermionic nature which 
prevails and thus the bosonic mass shifts are small, in a supersymmetric theory 
one may imagine that the hierarchy problem is solved. (The softer divergences 
of the supersymmetric systems are of similar origin as those responsible for the 
no renormalization theorems described before) The problem ab initio is a problem 
which involves in some manner theoretical taste and in any case it seems from the 
LEP data that if one insists on a hierarchy problem it is already present. SUSY has 
not come to the rescue in time and if it is a symmetry of nature, solving the hierarchy 
problem may well not be its main purpose. 

An earlier attempt to solve the hierarchy problem involved the introduction of 
a new gauge symmetry similar in many ways to color called technicolor. This is a 
specific realization of the idea that the Higgs particle in not a fundamental particle. 
Indeed both in describing aspects of superconductivity and superfluidity the Higgs 
scalar is only an effective degree of freedom. As of 2007 these line of ideas were 
not consistent with some of the experimental data. An often quoted problem is that 
such an interaction induces flavor changing neutral interactions at a too high rate. 


8.2.9 Effective Theories 


One may wonder why the Lagrangians describing the basic interactions are 
expressed in terms of a finite number of terms rather then by an infinite one. As 
long as the laws of physics allow the decoupling of far away scales this can be 
explained. The theory is written down in full generality in the presence of a short 
distance/high energy cutoff. The cutoff A may reflect the scale below which one 
has no knowledge on the interactions. The terms are constrained only by possible 
symmetries, their number is a priori infinite. The physics beneath any lower energy 
scale, A' is obtained by integrating out all the degrees of freedom which are heavier 
than A'. A new set of terms replaces the original set, it contains generically less 
terms. This process can be repeated till a theory approaches a critical surface. The 
theory on the critical surface is scale invariant. The resulting theory near the critical 
surface is well described by operators whose scaling dimensions, in d space-time 
dimensions, can be determined near the surface and happen to be smaller and at 
most not much larger than d. In many cases there is only a finite number of such 
operators, i.e., only a finite amount of terms in the Lagrangian describe the physics 
near the critical surface. This not only explains the concise form of the Lagrangian 
but provides one with a systematic method to classify the allowed terms to appear, 
the power of the method is further enhanced when symmetries are present, as those 
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constrain further the allowed terms. This can also sometimes be turned around. The 
collection of marginal and relevant operators may exhibit a symmetry of its own. 
This offers a proof that the symmetry should be present at low energies but may well 
disappear at higher energies if the irrelevant operators do not respect it. The resulting 
Lagrangian is called the effective low energy Lagrangian. It should contain all the 
light particles of the system and all the symmetries, may they be realized linearly or 
nonlinearly, (when the symmetry at question is spontaneously broken). Integrating 
out heavy particles results in a local Lagrangian consisting of a finite number of 
terms. Integrating out light degrees of freedom is likely to lead to non-local effects. 
The terms whose quantum scaling dimensions are smaller than d are called relevant 
terms. They become very large as the system is probed at lower energies and 
disappear at high energies, their number is usually finite and in many cases small. 
Examples of such terms are mass terms for both bosons and fermions in general and 
the non-Abelian Maxwell term for asymptotically free systems such as QCD. The 
terms whose scaling dimensions are exactly d are called marginal operators, they 
include the full Lagrangian for exactly scale invariant systems. Their number is also 
generically finite. They are equally important at all scales. Terms whose dimension 
is larger than d are called irrelevant operators. There is an infinite number of them. 
Each term on its own becomes insignificant in the low energy region and renders the 
theory non-renormalizable for high energies. (One can imagine examples where an 
infinite number of such terms collaborate to become relevant but that would most 
likely mean that the expansion around the critical surface should be modified). 
Those terms which have scaling dimensions not very much larger than d (such 
as five and six in four dimensions), can be useful hints for the scale at which the 
physics needs to be modified. For example the original four Fermi interactions are 
dimension six operators in four space-time dimensions. Their coefficient, the so 
called Fermi coupling, has dimension (—2) and hints at the nearly Tev scale of the 
weak interactions. Indeed at higher energies this irrelevant term is replaced by the 
classically marginal gauge interactions of the standard model. The replacement of 
an irrelevant operator by a relevant (or marginal) one at high energy leads to a well 
defined theory and it called UV completion. The UV completion is not unique, but 
its simplest version seems to do the job in the weak interactions case. This is clearly 
to be done when, like for the weak interactions in the 1950s, the leading term in the 
effective Lagrangian is irrelevant. When the irrelevant term appears in addition to 
marginal and relevant ones it may or may not indicate new physics. 


8.2.10 MSSM Lagrangian 


All this said one can write down the simplest low energy theory that by definition 
contains only marginal and relevant operators. For the case of systems with broken 
SUSY which still do not have a hierarchy problem one may allow in addition 
only those of the above operators which retain the at most logarithmic divergence 
structure of the theory. This requires that the classical marginal operators are all 
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supersymmetric as well as some of the classical dimension three operators. The 
dimension two relevant operators, namely the bosonic mass terms may break SUSY 
as long as the classically marginal terms are supersymmetric. The, so far, simplest 
model imposes minimality. It contains the Standard Model particles and interactions 
and their minimal extensions. Each Standard model particle is accompanied by a 
superpartner. Only one new supersymmetric multiplet is added in which neither the 
bosons nor the fermions are part of the Standard Model, this is a second Higgs 
field. The interactions are minimally extended to be N — 1 superymmetric. The 
system resulting for this construction is called the Minimal Super Symmetric Model 
(MSSM). The number of resulting terms may be minimal but it can hardly be 
considered as small, in fact it has 178 parameters. The superpotential in the MSSM 
contains the following terms: 


f-2uüjÉÜ- X | foie 0? AROS + Gn Of Raa DS + Go È$ Haak). 
i,j=1,3 
(8.120) 


The indices of SU(2); doublets are denoted above as a and b. The (fi)ij are 
the appropriate Yukawa couplings. The Quarks, Leptons and Higgs particles are 
all chosen to be chiral fields and the notation, for one generation, is made explicit in 
Table 8.1. 

The terms above were constructed to be invariant under the standard model 
symmetries, they turn out to conserve also both baryon (B) and lepton (L) numbers. 
However, the terms listed below are marginal and relevant terms which also 
conserve the standard model symmetries but do not conserve either B or L. The 
generic amount of violation induced by these terms is not consistent with the 
experimental data. Thus in MSSM one requires both B and L conservation. By 
the no renormalization theorems the symmetry will be respected to all orders in 
perturbation theory. Such symmetries which are preserved quantum mechanically in 
perturbation theory, once imposed classically, are denoted as natural in a “technical” 
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sense. 
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The B and L symmetries are in any case not respected by non-perturbative effects. 
The above mentioned terms can also be forbidden by imposing a different global 
symmetry called R-parity which was already mentioned in the context of SUSY 
breaking. It is defined here as: 


R = (—) 0-0, (8.123) 


s denoting the spin of the particle. The standard model particles are even 
under the R-parity, while their superpartners are odd under it. In MSSM the 
bosonic partners of the standard model matter fermions carry non-zero L and B 
quantum numbers. The (non)conservation of B and L symmetries are not in general 
correlated. They happen to coincide on the terms disallowed above. For example, 
the terms below both respect R-parity but the first violates L number conservation 
and the second does not respect B number conservation. 


cabl! beca L ÂI (8.124) 
and 
7, 0, D. Êe (8.125) 


For the record we write down all the dimension three and dimension two 
operators which softly break SUSY in the MSSM model. i and j run over the 
generations and are summed over as are the doublet SU(2) indices a and b. 
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The model, as written, has 178 independent parameters. Their number could 
be reduced if they had their origin in an underlying microscopic theory. In some 
present versions the matrices of parameters denoted above by cj; are set to zero 
thus reducing by fiat the number of free parameters to 124. This is but the tip of an 
iceberg, one proceeds from versions of the MSSM and derives the mass spectrum of 
the supersymmetric particles while ensuring to preserve the known properties of the 
standard model particles as well as a variety of experimental bounds. The bounds 
range from cosmological ones to bounds on rare decays. We hope that in the not 
too distant future the fog will disperse and one will be able to write a rather concise 
item in a physics encyclopedia which would select the relevant physical components 
of this effort. As mentioned any model for SUSY breaking should respect present 
experimental constraints. Many models of spontaneous SUSY breaking fail to do 
this as a result it was suggested to add to the yet to be seen superpartners of the 
standard model particles also a hidden sector. In such models SUSY is to be broken 
in the hidden sector and its effects are supposed to be mediated to the "seen" sector. 
The agents or messengers which couple the two sectors vary. In some models the 
coupling is by the gravitational force at tree level, in others the coupling occurs first 
only at the one loop level and is called anomaly (Weyl anomaly) mediated. There are 
models in which the coupling is done through gauge interactions, they have a lower 
energy scale than the gravity mediated interactions. Each of these models has some 
advantages as well as disadvantages and are at this stage an active area of research. 

We will end the section with a short glossary of terms currently used in describing 
MSSM features which were not mentioned above. 

Short Glossary: 


e Chargino— Charged supersymmetric partner of a charged standard model parti- 
cle. 

* Gaugino—Spin one half supersymmetric partner of a standard model gauge 
particle. 

* LSP—Lightest supersymmetric partner of a standard model particle. It is long 
lived in many models. 

* u term—Term coupling the two different Higgs chiral supermultiplets. 

* Neutralino—Neutral supersymmetric partner of a neutral standard model parti- 
cle. 

e squark—Spin zero supersymmetric partner of a standard model quark. 

e slepton—Spin zero supersymmetric partner of a standard model lepton. 

* tan(f)—Ratio of the expectation values of the two Higgs fields. 
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8.3 Unification 


8.3.1 Gauge Group Unification [2] 


The standard model is unified along the lines that all of its components, the 
colour, weak and electromagnetic interactions are described by gauge theories. 
They are SU(3) , SU(2) » UC). The gluons and the photons, the electrons and 
the quarks belong to different representations of the product gauge group, it is 
natural to attempt to have all of the elementary particles and interactions as a 
single representation of a single gauge group. Intermediate algebraic solutions to 
this problem have been found. For example the gauge group SU(5) does very 
economically unify the known gauge groups. Moreover it predicts that quarks 
and leptons can transmute into each other, in particular violating Baryon number 
conservation. The proton could decay in such models for example into an positron 
and a neutral pion. The original estimates of the half life time of the proton placed 
it possibly around 10?! years. That prediction was within experimental reached and 
initiated the construction of very ingenous experiments. Proton decay at that rate 
was not found, the lower bound on the proton life time was improved to be 1035, thus 
invalidating the simplest version of the grand unified group SU(5). In that version 
the particles did not all belong to the same representation as one may have wished 
based on esthetics. There have been many other attempts since to find an appropriate 
unifying group these included SO(10) (which incorporated naturally a right handed 
SU(2) singlet neutrino) and exceptional groups such as E(6). This was done with and 
without SUSY. Using the renormalization group it was found that in the presence of 
SUSY the in teractions may indeed unify in magnitude at a high energy not much 
below the planck scale of 10!? GeV. At such scales it becomes difficult to ignore 
quantum gravity. In any case we will not review this vast subject further here. 


8.3.2 Extra Dimensions and Unification [3] 


The first ideas of unification by increasing the number of dimensions were suggested 
in classical field theory by Kaluza and Klein (KK) in the period of the 1920s. At 
the time there were two well known interactions, gravity and electromagnetism. 
KK suggested that space time is five dimensional rather than four dimensional and 
that there is but one fundamental interaction-gravity. In order to be consistent with 
observations it was suggested that the five dimensional space time is composed out 
of a fifth dimension which is a spatial circle of inverse radius m, and the usual 
four dimensional Minkowski component. The radius 1/m should be small enough to 
have not been observed yet. The resulting low energy (i.e. energies much lower 
than m) five dimensional Lagrangian decomposes into several four dimensional 
Lagrangians. They describe, four dimensional gravity, four dimensional Maxwell 
electrodynamics and the coupling of a neutral spin zero additional particle. The 
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symmetry group of the low energy Lagrangian consists of four dimensional general 
coordinate invariance as well as a U(1) four dimensional gauge symmetry. This 
result is obtained in the following manner [3]. 

The basic five dimensional gravitational action is thus given by: 


^ 1 A 
$=, | £5 (8.127) 
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With ê? being the five dimensional Newton constant. The action $ is invariant 
under the fivedimensional general coordinate transformations 


0055 = Ink Boo + OE” Baa + P0585; (8.128) 


A useful 4 + 1 dimensional ansatz was: 
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The fields depend on the five dimensional coordinate £^. which were written as: 
RH = (x, y), u = 0,1,2,3, and all unhatted quantities are four-dimensional. The 
fields g,,, (X), Ay (x) and q(x) are the spin 2 graviton, the spin 1 photon and the spin 
0 dilaton respectively. The fields gu» 05, y), Aj G5 y) and (x,y) may be expanded in 
the form 


n—oo ; 
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Recall that m is the inverse radius in the fifth dimension. 

In general the five dimensional theory can be described in terms of an infinite 
number of four dimensional fields. It also has an infinite number of four-dimensional 
symmetries appearing the Fourier expansion of the five dimensional general coordi- 
nate parameter E Â (x, y) 


Enay E Ee, 
" mde l (8.131) 
Et (x, y) = 2 E^ (xe. 


However, at energy scales much smaller than m only the n — 0 modes in the 
above sums enter the low effective low energy action. 
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The n = 0 modes in (8.130) are just the four dimensional graviton, photon and 
dilaton. Substituting (8.129) and (8.130) in the action (8.127), integrating over y and 
retaining just the n = 0 terms one obtains (dropping the 0 subscripts) 


1 1 1 
$-1 dx Zg |^ - 5h09" à — ze nur] (8.132) 


where 2ztk? = mk^? and Fuy = 04Ay — vAu. From (8.128), this action is invariant 
under general. 

coordinate transformations with parameter £o, i.e. (again dropping the 0 
subscripts) 
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local gauge transformations with parameter £^o 
8A, = 0,£* (8.134) 


and global scale transformations with parameter À 


8A, = AAy, 89 = —22/V3 (8.135) 
The symmetry of a vacuum, determined by the VEVs 
< Suv >= Nw, < Ap >= 0, < Y >= G0 (8.136) 


is the four-dimensional Poincare group xR. These expectation values have not been 
determined dynamically but let us settle here for them. At this level of the analysis 
the masslessness of the graviton is due to unbroken four dimensional general 
covariance, the masslessness of the photon is consistent with the four dimensional 
gauge invariance and the dilaton seems massless because it is the Goldstone boson 
associated with the spontaneous breakdown of the global scale invariance. In fact 
taking into account the actual periodicity in y which is not manifested for only the 
n = 0 low energy modes the symmetry acting on the scalar field is U(1) rather than 
R and is thus not a true scale symmetry. The field qo is a pseudo-Goldstone boson 
and does not really deserve to called a dilaton. Further analysis uncovers an infinite 
tower of charged, massive spin 2 particles with charges e, and masses m, given by 


en 2 nN2km, m, =| n | m (8.137) 
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Thus the KK ideas provide an explanation of the quantization of electric charge. 
(Note also that charge conjugation is the parity tranformation y — —y in the fifth 
dimesion.) If one indeed identifies the fundamental unit of charge e = /2km with 
the charge on the electron, then one is forced to take m to be very large: the Planck 
mass 10? GeV. Such extra scalar particles seem to be present abundantly in string 
theories as well. The experimental question was mainly to set bounds on the value 
of the radius R(= 1/m) of the fifth dimension. The limits are obtained from precision 
electroweak experiments and require m > 7TeV. This is for models with one extra 
dimension, in which the gauge bosons propagate in the bulk but the fermions and 
Higgs are confined to four dimensions. More on this shortly. There are also bounds 
from astrophysics they are not model independent bounds. They are important for 
large dimensions in which only the graviton propagates, and they depend on number 
of large dimensions. The strongest bounds arise from supernova emissions, giving 
l/m < 107^ mm for the case of two large extra dimensions. Although researchers 
such as Einstein and Pauli had studied such theories for dozens of years this path has 
been abandoned. Extra dimensions resurfaced when string theory was formulated. 
It was found out that strings are much fussier than particles, (super) strings can 
propagate quantum mechanically only in a limited number of dimensions. In fact, 
not only was the number of allowed possible dimensions dramatically reduced, the 
allowed values of the number of space-time dimensions did not include the value 
4. These dimensions are usually required to be 10 or 26 (The origin of this basic 
number is, at this stage, disappointingly technical.) In fact while the numbers 10 
and 26 result from the theory they need not always be related directly to extra 
dimensions. When the extra dimensions emerged in string theory it was suggested 
in the spirit of KK that any extra dimensions are very small. As interest was diverted 
from string theory back to field theory the ideas of KK were revised allowing one 
to take into account the extra interactions discovered since the original work of 
KK. It turned out that if one which to trade all the known standard model gauge 
interactions, i.e. the all SU(3)  SU(2) .. UC) interactions, for a theory of gravity 
alone one was led to consider eleven dimensional gravity. Such theories raised 
interest also for other reasons. The return of string theory brought back with it an 
intense study of extra dimensions. In particular extra dimensions in the form of 
compact Calabi-Yau manifolds which allowed to have four dimensional effective 
theories with N — 1 SUSY. The topology of the extra dimensions determined in 
some cases the number of zero modes on them, that is the spectrum of massless 
particles. The number of generations of particles for example could be correlated 
thus to the topology of those extra dimensions offering a solution to the origin of 
the repetitive structure of the elementary particles. A vast amount of research on the 
possible extra dimensions is ongoing. There have also been attempts to understand 
dynamically the origin of the difference between the four large extra dimensions and 
the rest. Some efforts were directed to explain how a spontaneous breaking of space 
time symmetries could lead to such asymmetry in the properties of the different 
dimensions. Solid state systems such as liquid crystals also exhibit such differences. 
Other efforts focused on determining why an expanding universe would expand 
asymmetrically after a while only in four directions. Another development occurred 
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following the realization that string theory in particular allows what are called brane 
configurations. Branes are solitons, extended objects embedded stably in a space 
time of larger dimension. Vortices and magnetic monopoles were mentioned as 
such lower dimensional objects. There is a consistent possibility that for example 
a four dimensional universe can be embedded in higher dimensions. The known 
gauge interactions living only on the brane while gravity extends to the full space. 
If our universe has this structure many things can be explained and in particular 
this has given rise to the possibility the extra dimensions could be much larger than 
previously expected. Actually they could extend up to the submicron region. This 
could lead to measurable deviations from Newton's gravity at those distance scales. 
Astrophysics gives upper bounds on how large such extra dimensions may be but in 
any case this is a very significant relaxation of a bound, in fact I am not aware of 
any bound on such a fundamental quantity in physics that has been altered to such 
an extent by a theoretical idea. All this said one should stress the obvious, also the 
bounds have been relaxed the true value of the extra dimensions may still be very 
small, perhaps even Planckian. Some suggestions of larger extra dimensions could 
well be tested at the LHC. 

The fact that unification may occur not far from the planck scale brings quantum 
gravity to the front row and with it a theory which attempts to be able to indeed tame 
quantum gravity that is string theory. 


8.4 String Theory [4] 


8.4.1 No NOH Principle 


String Theory is at this time far from being a complete theoretical framework, not 
to mention a phenomenological theory. That said, the theory of extended objects 
has evolved significantly and has shaped and was shaped by aspects of modern 
mathematics. In my opinion is it appropriate to highlight the qualitative aspects of 
string theory and the ideas behind it. This choice is not made for lack of formulas 
or precision in string theory. An essential catalyst in the process of the formation of 
string theory was urgency, the urgency created out of the near despair to understand 
the amazing novel features of the hadronic interactions as they unfolded in the 
1960's. String theory was revitalized in the latter part of the twentieth century by 
the urgency to understand together all of the four known basic interactions. Here 
I shall briefly review some of the motivation to study a theory of extended objects 
and discuss the challenges string theory faces, the successes it has had, and the 
magic spell it casts. We start by reviewing a hardware issue, the hard wiring of some 
scientific minds. Researchers using string theory are faithful followers of an ancient 
practice, that of ignoring the NOH principle. The NOH principle is very generic, it 
states that Nobody Owes Humanity neither a concise one page description of all the 
basic forces that govern nature nor a concise description of all the constituents of 
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matter. Actually no reductionist type description is owed. Time and again physicists 
driven by their hard wiring processed the experimental data utilizing theoretical 
frameworks and were able to come up with explicit formulae; formulae that were 
able to express on less than one page the essence of a basic force. A key assumption 
used and vindicated was that the basic constituents of nature are point-like. The 
electromagnetic forces, the weak interactions and even the powerful color forces 
were rounded up one by one, straddled by the rules of quantum mechanics and 
then exhibited on a fraction of a page. Using these formulae, an accomplished 
student abandoned on an isolated island can predict with an amazing accuracy the 
outcome of some very important experiments involving the basic forces. The level 
of accuracy ranges from a few percent in the case of some aspects of the color forces 
to better than 107? for some features of the electromagnetic interactions. 


8.4.2 Why Change a Winning Team: Extended Constituents 
Are Called Upon to Replace Point-like Ones 


Point particles have done a tremendous job in shouldering the standard model of 
particle interactions. Why replace them by extended objects? In the section on 
SUSY we have outlined several of the reasons for that. The standard model does 
reflect the enormous progress made in the understanding of particle physics but it 
is not perfect, at least as seen by a critical theorist through his TPs. The model is 
afflicted with tens of parameters as well as what are termed naturality problems. 
The values of some physical quantities, such as the mass of the Higgs particle, 
are required to be much lighter than the theory would naturally suggest. Ignoring 
the NOH principle, scientists find themselves once again on the path of searching 
for a more concise description. One effort was directed towards unifying the three 
interactions, thus following the conceptual unification of the electromagnetic and 
weak interactions, described each by a gauge theory. That direction led to possible 
unifications at scales not so distant from where quantum gravity effects are definitely 
supposed to become important. In fact, gravity, the first of the basic forces that was 
expressed by a mathematical formula, is the remaining unbridled known force which 
is considered currently as basic. Here the quantum breaking methods based on the 
concept of a basic point like structure have failed. Sometimes what is considered 
failure is also a measure of intellectual restlessness, still it made sense to search 
for alternatives. Actually even with no apparent failure lurking upon basic physics, 
a study of a theory of fundamentally extended objects is called for. After all, why 
should the basic constituents be just point like? String theory is a natural extension 
of the idea that the basic constituents are point-like, it is an investigation into 
the possibility that basic constituents are ab initio extended objects. The simplest 
such extended objects being one-dimensional, that is strings. In a sense one may 
regard also point particles as extended objects dressed by their interactions. The 
direct consequences in this case are however totally different. It turned out that 
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such a generalization did eventually reproduce many properties of point particle 
interactions and in addition enabled one to formulate what seems as a theory of 
gravity. This it should do, following the tradition of subjecting any new theory to the 
test of the correspondence principle. The theory is well defined up to several orders 
and perhaps to all orders in perturbation theory. The perturbations are small at the 
vicinity of a very small distance scale called the string scale, as small as that distance 
is, it is sometimes expected to be larger than the Planck length and thus is associated 
with lower energies than the Planck energy (it is related to the plank scale by the 
string interaction coupling). Interactions among strings are, to a large extent, softer 
and fuzzier than those among point particles. Recall that a large number of successes 
of particle physics consisted of explaining and predicting the physics of the various 
interactions at scales at which they were weakly coupled, otherwise, a large dose 
of symmetry was essential. This is perhaps the most significant achievement of 
string theory. It is also its major source of frustration, if the string scale is indeed 
of the order of 10? cm or even slightly larger, it is not known today how to 
verify, experimentally, any prediction resulting from perturbative string theory. It 
is very difficult to imagine, for example, how to set up an experiment measuring the 
differential cross section of graviton-graviton scattering at the string scale. We will 
return to this extremely important issue later on and continue to follow for a while 
the path of the theoreticians. 


6.4.3 New Questions 


The impact of scientific progress can be tested by the type of questions it makes 
us aware of, as well as by the answers it eventually provides for them. By studying 
string theory new questions do arise. Values taken for granted are subjected to query 
and downgraded to being parameters. String theory suggests questioning the values 
of some such physical quantities and in some cases offers scientific answers to the 
question. A prime example is that of the number of space-time dimensions. In a 
point like constituent theory there is a very large degree of theoretical freedom. As 
far as we know spin zero point particles, for example, could have propagated in 
any number of dimensions, their interactions would have been different depending 
on the dimension of space-time but nevertheless they would have been allowed. 
A theory of spin one half particles can turn out to be in consistent on certain 
type of manifolds but they do not restrict the dimensionality of the space in 
which they propagate. Strings are much fussier than particles, (super) strings can 
propagate quantum mechanically only in a limited number of dimensions. In fact, 
not only are the number of allowed possible dimensions dramatically reduced, 
the allowed values of the number of space-time dimensions do not include the 
value 4. These dimensions are usually required to be 10 or 26 depending on the 
amount of supersymmetry the string is endowed with. The origin of this basic 
number is, at this stage, disappointingly technical and one should note that though, 
strictly speaking, the number 26 does indeed appear in string theory, it is not 
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always possible to associate this number with the number of dimensions. In fact 
the numbers appearing are 15 and 26 and they reflect from one point of view a 
conformal anomaly in the theory describing first quantized string theory. In first 
quantized field theory of particles can be expressed as a sum over one dimensional 
field theories, i.e. quantum mechanics, of a particle moving in space time and 
interacting along its space time trajectory. The action of the quantum mechanical 
system is geometrical and describes in its simplest form the length of the particle's 
trajectory. The result should not depend on the parameterization of trajectory and 
this should be general coordinate invariant. In a more fancy manner the action 
describes a particle coupled to gravity evolving in one space time dimension. The 
generalization to a description of the motion of a one dimensional object, a string, 
follows. As the string is a one dimensional object, its motion in space time spans 
a two dimensional manifold. The theory describing its motion is that of a particle, 
coupled to gravity, moving in two space-time dimensions. The two dimensions are 
called the world sheet and the theory describing them using first quantization is 
called the World Sheet theory. The usual counting of quantum degrees of freedom 
leads to the conclusion that gravity has minus one degrees of freedom in both one 
and two dimensions. In other words the system is over constrained and will require 
a higher degree of symmetry to be consistent. In addition the two dimensional 
gravitational system has an anomaly which restricts the algebraic structure of the 
conformal system describes by the string. A motion of a (super) string in (10) 26 
flat dimensions removes the anomaly. There are many other ways to remove it and 
perhaps many more to be discovered. Sometimes the background on which the string 
can propagate has no obvious geometrical description, just an algebraic one. The 
space in which the strings moves is called the Target Space. The interactions of 
strings are described by them parting or joining, this is described (in a Euclidean 
formulation) by a compact two dimensional surface with non-trivial topology. These 
manifolds are called Riemman manifolds and are distinct by their topology. The 
freely propagating string is described by a world sheet 2 sphere, the first interactions 
occur when the world sheet is a torus. The theory is defined perturbatively by 
summing with a certain weight over all such two dimensional Riemann surfaces. 
The non-perturbative structure is non-trivial but less understood at this stage. But 
setting that aside, one would like to detect extra dimensions experimentally. As the 
idea of having extra dimensions was raised originally in field theory, upper bounds 
on the length of such dimensions were already available, more recently under the 
security net of string theory these bounds have been revised (they have been relaxed 
by more than 10 orders of magnitude and brought up to the sub micron regime), 
in fact I am not aware of any bound on a fundamental quantity in physics that 
has been altered to such an extent by a theoretical idea. This fact about strings 
illustrates the tension inherent in attempts to search for experimental verification 
of extremely basic but very weakly coupled phenomena. Sitting in our chairs we 
sometimes forget how frail gravity is. It is all the planet earth that gravitationally 
pulls us towards its center, yet all this planetary effort is easily counterbalanced 
by the electromagnetic forces applied by the few tiles beneath our chair. Although 
string theory sets constraints on the possible number of dimensions, that is not to 
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say that point particles allow everything. We mentioned that spin one half particles 
can't be defined on all manifolds but the number of known restrictions is much 
smaller. The interactions could be of an infinite (sometimes classifiable) variety, the 
color group, which happens to be SU(3), could have been for all the theory cares, 
SU(641), the electromagnetic force could have been absent or there could have been 
17 photons. Not everything is allowed, but an infinite amount of variations could 
have been realized instead of what one actually sees in nature today. The same goes 
for many (but not all) of the properties of the space-time arena in which all the 
interactions are occurring, in particular effective low energy theories (and one seems 
to be constrained to use only such theories for a long time to come) contain many 
more free parameters such as masses and some of the couplings. In a string theory 
one may have expected that all or most of these parameters are fixed in some manner. 
Actually is it more complicated, for some string backgrounds, some parameters are 
fixed, while other parameters are not. In addition there seemed to be several string 
theories leading to even more possibilities. 


6.4.4 The One and Only? 


As we have discussed the desire to find the one and only theory encompassing all 
fundamental physical phenomena seems hard wired in many scientific minds. Is 
string theory an example of such a theory? Before addressing this question let us 
reflect upon the limitations of the methods used. What would in fact satisfy the 
purest (or extremist) of reductionists? A one-symbol equation? Be her desires, what 
they may be, one should recall that one is using mathematics as our tether into the 
unknown. This tool, which has served science so well, has its own severe limitations, 
even ignoring the issue of using differential equations as tool to probe short distance, 
a generic problem in mathematics can't be solved, maybe the NOH principle will 
eventually have its day, one will run out of interesting questions which have answers, 
maybe that day is today! (Although some mathematicians suggest to use physicists 
as hound dogs that will sniff one's way towards interesting and solvable problems). 
Having this in mind the key question is posed in hope it does have an answer: find the 
unique theory describing the fundamental forces. There have been ups and downs 
for the proponents of the one and only string theory. The understanding of the 
situation has passed several evolutionary stages. From the start it was recognized 
that one may need to settle for more than only one string theory. It was thought 
that a distinction could been drawn between a theory of only closed strings and a 
theory containing both open and closed strings. A theory with only open strings 
was realized to be perturbatively inconsistent. Even those theories needed not only 
a fixed dimension but in addition an extra symmetry, supersymmetry, to keep them 
consistent. It was very quickly realized that actually there are an infinite varieties of 
backgrounds in which a string could move. For example, consider 26 dimensional 
spaces in which one dimension is actually a circle of radius R. It turned out that all 
possible values for the radius, R, of the circle are allowed. It was then realized that 
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Fig. 8.7 The potential, familiar from the standard model, has a shape of a sombrero. In the figure, 
its minima lie on a circle, each point along the circle, can be a basis for a ground state. The physics 
around each is equivalent. In string theory and in supersymmetric systems flat potential arise 


Fig.8.8 The potential has two orthogonal flat directions. The landscape of the valleys is 
generically more elaborate. Each point along the flat directions can serve as a basis of a ground 
state. In many cases. The physics around each ground state is different 


there is a large variety of 22 dimensional compact manifolds in the case of bosonic 
strings (6 dimensional compact dimensions in the case of the supersymmetry), 
which could accompany the four dimensional Minkowski space-time one is familiar 
with. Each such compact manifold was called a string compactification. Next it 
was suggested that actually all possible compactifications are nothing but different 
solutions/ground states of a single string theory. Each of the solutions differing from 
each other by the detailed values of the physical parameters it leads to. But the 
ground states have also many common features, such as generically the same low 
energy gauge group. To appreciate this consider the difference between the potential 
of the Higgs particles which may describe the mass generation of the carriers of the 
weak interactions and the effective potential leading to the ground states in string 
theory. For the electro-weak interaction case the potential has the form of a Mexican 
hat, the ground state may be described by any point in the valley (Fig. 8.7) but all 
choices are equivalent, they describe exactly the same physics. 

In the case at hand the valley of minima is flat and extends all the way to infinity. 
Any point along the valley can be chosen as a ground state as they all have the 
same energy and yet each describes different physics. The manifold describing the 
infinity of these degenerate ground states is called the moduli space when they are 
continuously connected (Fig. 8.8). 
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These connected regions of valleys trace out very elaborate geography and 
describe different possible solutions of string theory. All known stable ones share in 
common the property of describing a supersymmetric theory. In such theories each 
bosonic particle has a fermionic particle companion. It then turned out that there 
actually are several different types of string theories, each having its own infinite set 
of degenerate ground states. What made the string theories different were details, 
most of which are rather technical, which seemed to affect the particle content and 
gauge symmetries of the low energy physics. There was a stage, in the eighties of 
the last century, at which some researchers imagined they were on the verge of being 
able to perform reliable perturbative calculations of a realistic theory containing all 
the aspects of the standard model. That turned out not to be the case. However, in 
the process, the geography of many interesting ground states was surveyed. Next, a 
large amount of circumstantial evidence started to accumulate hinting that all these 
theories are after all not that distinct and may actually be mapped into each other 
by a web of surprising dualities. This led once again to the conjecture that there is 
but one string theory, this time, the theory that was supposed to encompass all string 
theories and bind them, was given a special name, M theory. Even in this framework 
all string theories brought as dowries their infinite moduli space. M theory still had 
an infinite number of connected different degenerate supersymmetric ground states, 
they were elaborated and greatly enriched by what are called brane configurations. 
Branes are a special type of solitons that appear in string theory. They come in 
various forms and span different dimensions. What is common to several of them, 
is that open strings may end on them. 

Thus what was thought to be a theory of only closed strings contains open strings 
excitations in each of its sectors that contain branes. Moreover, every allowed brane 
configuration leads to its own low energy physics. In fact there are suggestions that 
our four dimensional experience results from us living on one or several branes 
all embedded in a larger brane configuration placed in a higher dimensional space- 
time. How does that fit with the expectations to have the One and Only string theory? 
Well, in this framework there is one theory but an infinite amount of possible ground 
states. Are we owed only once such state? There are those who hope that the vacua 
degeneracy will be lifted by some yet to be discovered mechanism leaving the one 
unique ground state. Others are loosing their patience and claiming that the NOH 
principle takes over at this stage, the theory will retain its large number of ground 
states. 

Moreover a crucial ingredient of string theory at present is super symmetry 
which was discussed obove, a beautiful symmetry not yet detected in nature, and 
even if detected, it seems at this stage to play only a role of a broken symmetry. 
It is not easy to obtain even remotely realistic models of string theory in which 
supersymmetry is broken (that is the symmetry between fermions and bosons is 
broken), a nearly vanishing cosmological constant and light particles whose mass 
values are amendable to a calculation within a reliable approximation. Candidates 
for such ground states have been suggested recently, these particular candidates 
consist of very many isolated solutions living off the shore of the supersymmetric 
connected valleys. Many of these solutions actually have different values for the 
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vacuum energy and perhaps may thus decay from one to the other. In particular 
bubbles may form within one configuration, bubbles that contain in them a lower 
energy configuration. Some researchers would like to see the universe itself, or 
the bubbles that form in it, as eternal tourists who will end up visiting many of 
these metastable states. Some suspect that many of these metastable states are 
stable. Some would encourage brave sailors to take to sea again, as they did in 
the 1980s, and charter this unknown geography, some are trying to make this 
endeavor quantitative by counting vacua which have a given property, such as a 
given cosmological constant. This subject is still at its infancy, many hopes and 
opinions are currently expressed, it would be nice if this issue will turn out to fall 
within the realm of solvable problems. 

The challenges to string theory on the experimental side have been to reproduce 
the standard model. This has not been done to this day. Many new ideas and 
insights have been added. Bounds have been shifted several times, one has felt 
being very near to obtaining the desired model, each new approach seemed to bring 
one nearer to that goal, but obtaining one completely worked-out example, which 
is the standard model, remains a challenge for string theorists. Another important 
issue is that of the value of the cosmological constant. It is usually stated that the 
small value of the cosmological constant is a major problem. These arguments are 
centered around some version of a low energy effective field theory. That theory 
could be valid below a few tens of electron volt thus describing only aspects of 
electromagnetism or it could be at a TeV scale describing both the electroweak 
interactions. The arguments goes on to say that the vacuum energy of such an 
effective theory should be, on dimensional arguments grounds, proportional to the 
scale of the cutoff, (the cutoff is set at that energy beyond which the ingredients of 
the physics start to change) that is evs or TeVs in the above examples. In either 
case, the value of this vacuum energy, which is the source of the cosmological 
constant, is larger by an astonishing number than the known bounds on the value 
of the cosmological constant. The argument is not correct as stated, it is not only the 
cutoff that contributes to the cosmological constant, all scales do actually contribute 
to the vacuum energy and if a cutoff should be invoked, it is the highest one, such as 
the Planck scale, that should be chosen. Moreover, this argument does not take into 
account the possibility that the fundamental theory has a certain symmetry which 
could be the guardian of vanishingly small value of the cosmological constant. Such 
a symmetry exists, it is called scale invariance, and it has shown to be a guardian 
in several settings. Scale invariance is associated with the absence of a scale in the 
theory. There are many classical systems which have this symmetry and there are 
also quantum systems which retain it. In a finite, scale invariant theory not only 
does the vacuum energy get contributions from all scales, they actually add up 
to give rise to a vanishing vacuum energy. This remains the case even when the 
symmetry is spontaneously broken. Being a consequence of a symmetry, the vacuum 
energy contribution to the cosmological constant vanishes also is the appropriate 
effective theory. What does string theory have to say about that? String theory 
could well be a finite theory, it is not scale invariant, as it contains a string scale, 
if that scale would be spontaneously generated perhaps one would be nearer to 
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understanding the value of the cosmological constant. If that scale is indeed not 
elementary, one would be searching for the stuff the strings themselves are made 
off. Such searches originating from other motivations are underway, some go by the 
name of Matrix Model. Particle Physics as we know it is described by the standard 
model evolving in an expanding universe. The methods used to study string theory 
are best developed for supersymmetric strings, and for strings propagating on time 
independent background, nature is explicitly neither. What then are the successes of 
string theory? 


6.4.5 Successes: Black Holes, Holography and All That ... 


A major success described above was to obtain a theory of gravity well defined 
to several orders in perturbation theory, in addition, no obvious danger signals 
were detected as far as the higher order in perturbation theory are concerned, non 
perturbative effects are yet to be definitely understood. This issue involves the short 
distance structure of string theory. String theory is supposed to be a consistent 
completion of General Relativity (GR). General relativity suffers from several 
problems at low energies. String theory, which according to a correspondence 
principle, is supposed to reproduce GR in some long distance limit, will thus have 
inherited in this merger, all the debts and problems of GR. If it does solve them it will 
need to do it with a "twist" offering a different point of view. That perspective could 
have been adopted in GR but was not. This has occurred in several circumstances, 
one outstanding problem in GR is to deal with the singularities that classical gravity 
is known to have. These include black holes, big bangs and big crunches as well as 
other types of singularities. String theory can offer a new perspective on several of 
these issues. Let us start with black holes. The first mystery of black holes is that 
they seem to posses thermodynamical properties such as temperature and entropy. 
This is true for charged black holes as well as for uncharged Schwarzschild ones. 
In the presence and under the protection of a very large degree of supersymmetry, 
string theory tools enabled to provide a detailed microscopical accounting of the 
entropy of some black holes. More precisely it was shown in these special cases 
that the number of states of a black hole is identical to the number of states of 
essentially a gauge theory. It was possible to count the microscopic number in the 
gauge theory case and the resulting number of states was exactly that predicted 
for black holes! These are mostly higher dimensional very cold black holes. This 
has not yet been fully accomplished for uncharged black holes. Black holes have 
several more non-conventional properties. The above mentioned entropy of a mass 
M black hole is much smaller, for many types of black holes, than that of a non 
gravitating system of particles which have the same energy, M. In fact one may 
suspect that such black holes dominate by far the high energy spectrum of any 
gravitating system. The Schwarzschild radius of these black holes actually increases 
with their mass. In general one associates large energies with short distances, in the 
case of black holes large energies are related also to large distances. A meticulous 
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separation of long and short distances is a corner stone of the renormalization group 
approach, this is not evident in the case of gravity. One possible implication of this 
behavior is that in gravitational systems the amount of information stored in a spatial 
volume is proportional to the boundary's surface area rather then to the bulk volume. 
String theory provides explicit and concrete realizations of this amazing behavior. 
A system of strings moving in particular ten dimensional space-times (of the anti 
de Sitter type) are equivalent to special highly symmetric supersymmetric non- 
Abelian gauge theories living in four dimensions and experiencing no gravitational 
forces. Moreover modifying the backgrounds on which the strings propagate by 
adding defects to them, one finds that strings propagating on certain backgrounds are 
equivalent to gauge theories in which color is confined. String theory methods are 
used to perform difficult calculations in hadronic physics. String theory, which was 
born out of hadronic physics, has returned to pay back its original debt! In addition, 
remarkable structures have been uncovered in supersymmetric gauge theories using 
string theory methods. Attempts have been made to confront strings with other 
singularities. Why should strings respond to singularities in any other way than 
particles do? A Talmudic story helps explain the manner in which extended objects 
modify problems associated with point particles. During a seminar, the following 
question came up: Assume a pigeon (a very valuable animal at the time) is found 
somewhere, to whom does it belong? The rule is, finders-keepers, as long as the 
pigeon is found further away, than some determined distance, from the entrance to 
an owned pigeon-hole. A student raised his hand and asked: “The pigeon is, after all, 
an extended object, what if one of its legs is nearer than the prescribed distance to 
the pigeon-hole but the other leg is further away?" The student was ejected from 
his place of study for raising the question. Strings definitely turn out to soften 
some harsh singularities which are time independent (called time like singularities). 
Strings are also found to punch through singularities which form at particular times 
such as a big crunch. The analysis of such systems is still at very early stages, 
still I would like to note a very interesting feature in common up the present to all 
attempts to study using string theory methods universes that are spatially closed. In 
these cases the compact universe is found to be immersed in one way or another in 
a non compact universe. The study of strings near these singularities could well be 
essential to appreciate their properties at short distances. A key ingredient driving 
some researchers is a new panoramic vista opening up to them. For some it is the 
precise calculations of experimentally measurable quantities, for others it is the 
Magic. 


6.4.6 Magic 


The universe as viewed with string probes is full of magic ambiguities and 
symmetries unheard of before. In fact most mathematical concepts used to describe 
the universe are veiled under symmetry. There are cases for which each of the 
following concepts becomes ambiguous: distances, the number of dimensions, 
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topology, singularity structure, the property of being commutative or not being 
commutative. Let us somewhat elaborate on the magic. 


* Distance—The simplest example is that of a universe in which one of the 
dimensions is extended along identified with a circle of radius R. It turns out 
that an experimentalist using strings as his probes will not be able to determine 
if the universe is indeed best described but one of it dimensions being a circle of 
radius R (in the appropriate string scale) or by being a circle of radius 1/R. For 
point particles moving a circle of radius R, the energy spectrum has large gaps, 
when R is small, and very small gaps, when the value of R is large. For strings, 
which are extended objects, the spectrum consists of an additional part, that of 
the strings wrapping around the circle. A point particle can’t wrap around the 
circle. This part of the spectrum is narrowly gapped for small values of R and 
widely gapped for large values of R, this is because the energy required to wrap a 
string, which has its tension, around the circle, is proportional to the length of the 
wrapped string. For an experimentalist, who can use point particles as probes, the 
distinction is clear, not so for one using string probes. This is but the tip of the 
iceberg of an infinite set of ambiguities. Some geometries, judged to be different 
by point probes, are thus identified by extended probes. 

* Dimensions—One could at least expect that the value describing the number of 
spacetime dimensions to be unique. It turns out not to be the case. Ambiguities in 
calling the “correct” number of space-time dimensions come in several varieties. 
In string theory there are examples of a string moving in certain ten dimensional 
space-times which measures exactly the same physics as that measured by a 
certain point particle gauge theory in four dimensions. Each system is made out 
of totally different elementary constituents, one system is defined to exist in ten 
dimensions, the other in four and yet both reproduce the same observables, the 
same physics. Another example consists of a strongly coupled string theory in ten 
dimensions whose low energy physics is reproduced by an eleven dimensional 
supergravity theory. This is true for large systems. For small, string scale, 
systems, more magic is manifested. A string in some cases can't distinguish if it 
is moving on a three dimensional sphere or a one dimensional circle. So much 
for non-ambiguous dimensions. 

e Topology—Objects that can be deformed to each other without using excessive 
violence (such as tearing them) are considered to have the same topology. The 
surface of a perfect sphere is the same, topologically, as that of the surface of 
a squashed sphere, but different, topologically, than that of a bagel/torus. Well, 
once again that is always true only as far as point particles probes are concerned. 
In string theory there are examples in which the string probes can describe the 
same set of possible observations as reflecting motions on objects with different 
topologies. In addition in string theory there are cases were one may smoothly 
connect two objects of different topology in defiance of the definition given above 
for different topologies. Topology can thus be ambiguous. 

* Singularities—A crucial problem of General Relativity is the existence of singu- 
larities. Bohr has shown how such singularities are resolved in electrodynamics 
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by quantum mechanics. In GR one is familiar with singularities which are 
stationary (time-like) such as the singularity of a charged black hole and those 
which form or disappear at a given instance instant such as the big bang and 
the big crunch (space-like). What happens in string theory to black holes big 
crunches or big bangs? A good question. Consider first the motion a particle on 
a circle of some radius and the motion of the string on a segment of some length. 
The circle is smooth, the segment has edges. For a point particle probe the first is 
smooth and the second is singular. In some cases, viewed by stringy probes, both 
are actually equivalent, i.e. both are smooth. The string resolves the singularity. 
Moreover there are black holes for which a string probe can't distinguish between 
the black hole's horizon and its singularity. There are many other cases in string 
theory where time-like singularities are resolved. The situation in the case of 
space-like singularities is more involved and is currently under study. Some 
singularities are in the eyes of the beholder. 

* Commutativity—The laws of quantum mechanics can be enforced by declaring 
that coordinates do not commute with their conjugate momenta. It is however 
taken for granted that the different spatial coordinates do commute with each 
other and thus in particular can be measured simultaneously. The validity of this 
assumption is actually also subject to experimental verification. A geometry can 
be defined even when coordinates do not all commute with each other. This is 
called non-commutative geometry. There are cases when strings moving on a 
commutative manifold would report the same observations as strings moving on 
a non-commutative manifold. It seems many certainties can become ambiguities, 
when observed by string probes. Stated differently, string theory has an incredible 
amount of symmetry. This may indicate that the space-time one is familiar with, 
is in some sense suited only for a “low” energy description. At this stage each 
of these pieces of magic seems to originate from a different source. The hard- 
wired mind seeks a unified picture for all this tapestry. This review may reflect 
ambiguous feelings, challenges, success and magic each have their share. All in 
all, string theory has been an amazingly exciting field, vibrating with new results 
and ideas, as bits and pieces of the fabric of space time are uncovered and our 
idea of what they really are shifts. 


6.4.7 Human Effort and Closing Remarks 


One may estimate that more than 10,000 human years have been dedicated up to 
now to the study of string theory. This is a global effort. It tells an amazing story 
of scientific research. In the mid eighties a regime/leadership change occurred. The 
scientific leaders which had nursed the standard model, have been replaced by a 
younger generation of string theorists. Some of those leaders complained that the 
field had been kidnapped from them. Mathematical methods of a new type were 
both applied and invented, many pieces of a vast puzzle were uncovered and a 
large number of consistency checks were used to put parts of the puzzle together 
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tentatively. As in any human endeavor alternative paths are suggested and critique is 
offered. I have incorporated in this note some of the serious challenges string theory 
faces. There are additional complaints, complaints that the new style leaves too 
many gaps in what is actually rigorously proven, complaints that a clean field theory 
description is needed and lacking. Research has been democratized and globalized 
to a quite unprecedented extent. This has mainly been achieved by making research 
results available simultaneously to most of the researchers by posting them on the 
web daily. There are some drawbacks inherent to that evolution, research attitudes 
have been largely standardized and the ranges of different points of view have been 
significantly focused/narrowed. The outside pressures to better quantify scientific 
production have found an easily accessible statistical database. In particular, that 
is leading to assigning a somewhat excessive weight for the citation counting in 
a variety of science policy decisions. The impact of that is yet to stabilize. All 
that said, one should notice that phenomenologists and experimentalists do turn to 
string theory and its spin offs in search of a stimulus for ideas on how to detect 
possible deviations from the standard model. The problems string theory faces 
are difficult, one approach is to accept this and plunge time and again into the 
dark regions of ignorance using string theory as an available guide, hoping that 
it will be more resourceful than its practitioners are. Another approach is to break 
away from the comfort of physics as we formulate it today and replace some of 
its working axioms by new ones, such as holography, the entropic principle or 
eventually perhaps something else. It seems that the question of the exact nature of 
the basic constituents of matter will remain with us for still quite awhile. My favorite 
picture is that it will turn out that asking if the basic constituents are point like, are 
one, two or three-dimensional branes is like asking whether matter is made of earth 
or air. A theory including the symmetries of gravity will have different phases, some 
best described by stringy excitations, some by point particles, some by the various 
branes, and perhaps the most symmetrical phase will be much simpler. Several of 
these will offer the conventional space time picture, other will offer a something 
new. String theory either as a source of inspiration, or as a very dynamic research 
effort attracting criticism leaves few people indifferent. 


8.5 Hindsight from 2018 


The years from 2007 to 2018 were yet another example of the well-known difficulty 
of predicting the future. Luckily what I wrote in 2007 contained, on this front, 
enough disclaimers to satisfy the most strict of lawyers. 

During those years one could celebrate a gigantic engineering triumph. After 
succumbing to a very significant early setback the LHC outperformed its expecta- 
tions. The experimental triumph was as impressive. The detectors performed well 
beyond what one could dream. They obtained results from picking out the Higgs 
particle out of haystack through rediscovering particles which were first detected at 
electron-positron machines to finding more hadrons in the heavier flavour sectors. 
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This success was not matched by validating any of the TPs (Theoretical 
Prejudices) which I had discussed in some detail. The discovery of the quite likely 
absence of new particles in the new energy vistas opened by the LHC had an impact. 
Of course one can still hope that positive new discoveries will be made and I for one 
do hope so; still it is more and more considered as a possibility that we will need to 
settle for a long time for accelerators that will validate the Standard Model and not 
uncover treasures beyond it. A school of thought is developing, since the activation 
of the LHC, that beyond the Standard Model lurks non other than the Standard 
Model itself. At least for a while. 

This requires soul searching of the theoretical physics community. What could 
be the result of such an examination of the routes taken so far? Recall that NOH 
(Nobody Owes Humanity), it may be that using elegance and beauty as our guiding 
principles, and indeed SUSY and string theory are beautiful, has failed us. Perhaps 
our esthetics taste does not confirm with that of nature. Perhaps these TPs will 
manifest themselves at some higher energy and perhaps we are on a totally wrong 
track. Falsifying ideas is a most important component of the scientific method, but 
many would not agree that the TPs have been indeed falsified. New ideas should be 
encouraged and welcomed, there is nothing as good as a good new idea to revitalize 
our thinking. The results of such debates cannot be dictated. Each researcher will 
draw her/his own conclusions. Be that as it may, such an open debate is required. 
Hopefully there will be something to report on in the next edition of this book. 

One aspect of “Beyond” physics that is considered more actively now involves 
testing the possibility that the Standard Model has scale or even conformal invariant 
features. Some of them I suggested long ago and pointed out in this volume, 
especially its relation to the puzzle of the value of the vacuum energy. 

The fact that, depending on the exact value of the top quark mass, the Universe is 
living dangerously and may eventually self-destruct as it decays to some other place 
was also noted and investigated. 

The dark matter elephant in the room remains very present. 

The mathematical aspects of SUSY have been considerably sharpened during 
these years and have led to many new surprising exact results in more regions of 
strong coupling where no one ventured before. I was most attracted to a branch 
of research trying to attempt to use quantum information aspects as ingredients 
that build up semiclassical geometry and also quantum space time. These attempts 
are in their infancy but they have already covered quite some ground. Sometimes 
it has happened that a change of perspective and bringing in new tools was a 
way for a younger generation to take over the leadership of a field. The absence 
of experimental data to crown new leaders serves as a fertile ground for this 
mechanism. 

Finally, back to experiment; for years black holes seemed as removed from 
experiment as the detection of stringy effects are 2017 has seen the hundred years 
search for the detection of gravitational waves reach a discovery moment. Humans 
observed particles collide at the LHC and black holes collide in the Universe—a 
great moment. Theorists are seriously considering the properties of those very black 
holes in terms of information processing. Seeds for a new TP have been sown. 
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Over this decade we have learned or should have learned the importance of 
patience, humility, and diversity of ideas. Not the type of lessons that theoretical 
physicists appreciate too much. 


8.6 References for 8 


We mention here only text books and reviews and only those which were used to 
prepare this entry. Most of the appropriate references for the works reviewed here 
can be found in them. 


(1) For Super Symmetry: 

Different narratives on the history of Super Symmetry by a large number of 
its pioneers has been collected in the book The Supersymmetric World, edited 
by G. Kane and M. Shifman. Published by World Scientific (2000). As most 
of them were still alive at the time this offers a unique perspective into how 
theoretical ideas develop. 


— Supersymmetry and Supergravity by J. Wess and J. Bagger. Princeton Series 
in Physics (1992). 

— The Quantum Theory of Fields, Volume 3: Supersymmetry by S. Weinberg. 
Cambridge University Press (2000). 

— Lectures on Supersymmetric Gauge Theories and Electric—Magnetic Dual- 
ity by K.A. In-triligator and N. Seiberg. Published also in Proceedings of 
Cargese Summer School lectures, Cargese. 

— Supersymmetric Gauge Theories by D. Berman and E. Rabinovici. Published 
in: Unityfrom Duality: Gravity, Gauge Theory and Strings, Les Houches 
Session LXXVI. Editors: C. Bachas, A. Bilal, M. Douglas, N. Nekrasov and 
F. David. Publisher: Springer. 

— Weak Scale Super Symmetry: From Superfields to Scattering Events by H. 
Baer and X.Tata. Cambridge University Press (2006). 

— Lectures on Supersymmetry Breaking by K. Intriligator and N. Seiberg. 
Published in Class.Quant. Grav. (2007). 


(2) For gauge theory: 

This subject was barely touched upon here. A group theory reference is: 
Group Theory for Unified Model Building by R. Slansky. Published in Phys. 
Rept. 79 1-128 (1981). 

(3) For extra dimensions: 


Kaluza-Klein Theory in Perspective. M.J. Duff (Newton Inst. Math. Sci., 
Cambridge). NI-94-015, CTP-TAMU-22-94, Oct. 1994. 38 pp. Talk given 
at The Oskar Klein Centenary Symposium, Stockholm, Sweden, 19-21 
Sept. 1994. In ,Stockholm 1994, The Oskar Klein Centenary, 22-35, hep- 
th/9410046. 
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(4) For strings: 
For text books and many references see for example: 


— MB. Green, J.H. Schwarz, E. Witten: Superstring Theory. Cambridge 
Monographs On Mathematical Physics (1987). 

— J. Polchinski: String Theory, Cambridge University Press (1998). 

— E. Rabinovici: String Theory: Challenges, Successes and Magic. Published 
in Europhys. News (2004). 
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Chapter 9 A 
Symmetry Violations and Quark Flavour — wi 
Physics 


Konrad Kleinknecht and Ulrich Uwer 


9.1 Introduction 


9.1.1 Matter-Antimatter Asymmetry in the Universe 


One of the surprising facts in our present understanding of the development of 
the Universe is the complete absence of "primordial" antimatter from the Big 
Bang about 13.7 billion years ago. The detection of charged cosmic-ray particles 
by magnetic spectrometers borne by balloons, satellites, and the space shuttle 
has shown no evidence for such primordial (high-energy) antibaryons; nor has 
the search for gamma rays from antimatter-matter annihilation yielded any such 
observation. In the early phases of the expanding Universe, a hot (10?? K) and 
dense plasma of quarks, antiquarks, leptons, antileptons and photons coexisted in 
equilibrium. This plasma expanded and cooled down, and matter and antimatter 
could recombine and annihilate into photons. If all interactions were symmetric with 
respect to matter and antimatter, and if baryon and lepton numbers were conserved, 
then all particles would finally convert to photons, and the expansion of the Universe 
would shift the wavelength of these photons to the far infrared region. 

This cosmic microwave background radiation was indeed observed by Penzias 
and Wilson in 1965 [1], and its wavelength distribution corresponds exactly to 
Planck black-body radiation at a temperature of 2.73 K (see Fig. 9.1). The density 
of this radiation is about 5 x 10? photons/cm?. 
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However, this radiation is not the only remnant of the Big Bang; there is also a 
small amount of baryonic matter left over, at a density of 6 x 107? nucleons/cm?, 
about 107? of the photon density. This phenomenon can only be explained if the 
three conditions of Sakharov [2] are fulfilled: 


— there must be an interaction violating CP invariance, where C is the particle— 
antiparticle transformation and P the space inversion operation; 

— there must be an interaction violating the conservation of baryon number; 

— there must be phases of the expansion without thermodynamic equilibrium. 


The first condition was shown to be fulfilled when, in 1964, J. Christenson, J. 
Cronin, V. Fitch and R. Turlay discovered CP violation [3] in decays of neutral 
K mesons. The second criterion would imply that protons are not stable; searches 
for such a decay have been unsuccessful, showing that the lifetime of the proton 
is longer then 10?! years. The third condition can be met in cosmological models 
by inflationary fast expansion or by a first-order phase transition in the electroweak 
interaction of the Standard Model. 

In the following, we shall concentrate on the observed CP violation, which could 
in principle lead to a small surplus of matter, the observed baryon asymmetry of 
1071 in the Universe. 


9.2 Discrete Symmetries 


Symmetries and conservation laws have long played an important role in physics. 
The simplest examples of macroscopic relevance are the laws of conservation of 
energy and momentum, which are due to the invariance of forces under translation 
in time and space, respectively. Both are continuous transformations. In the domain 
of quantum phenomena, there are also conservation laws corresponding to discrete 
transformations. One of these is reflection in space (the “parity operation") P 
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[4]. The second discrete transformation is particle-antiparticle conjugation C [5]. 
This transforms each particle into its antiparticle, whereby all additive quantum 
numbers change their sign. A third transformation of this kind is time reversal 7, 
which reverses momenta and angular momenta [6]. This corresponds formally to an 
inversion of the direction of time. According to the CPT theorem of Lüders and 
Pauli [7—9], there is a connection between these three transformations such that, 
under rather weak assumptions, in a local field theory all processes are invariant 
under the combined operation C - P - T. 


9.2.1 Discrete Symmetries in Classical Physics 
9.2.1.1 Parity P 


The parity operation consists in reversing the direction of the position vector F = 
(X, Y, Z) in Cartesian coordinates. This corresponds to reflection in a plane mirror, 
followed by a rotation by 180°. Symmetry under parity operation is therefore also 
called mirror symmetry. 

The parity operation reverses the direction of all polar vectors derived from the 
position vector; in particular, this is the case for the momentum p = mv = mdr /dt 
and the acceleration à = d?r /dt*. Therefore, the Newtonian force F- dp/dt is also 
reversed under the parity operation. 

This must be also the case for the Lorentz and Coulomb forces on a particle with 
charge q moving with velocity v: 


F = q(E +ï x B). (9.1) 


Since the charge q is invariant under P, and the force F and the velocity v change 
sign, the electric field strength E must change sign and the magnetic field strength 
B must remain unchanged. 

For the electric potential A, we obtain from the relations 


E = —grad V — 8 AJ8t, (9.2) 


m 


B = rot A, (9.3) 


the result that A changes sign and V remains invariant, since the spatial differential 
operator changes sign under the parity operation. 

We therefore have four classes of quantities with different transformation behav- 
ior under P: axial vectors or pseudovectors such as B and the angular momentum 
J =7x p, and scalars such as V, which remain invariant under P; and polar vectors 
suchas”, p, F 5 E and A, and pseudoscalars such as E. B, which change sign under 


P. 
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9.2.1.2 Time Reversal T 


This operation consists in reversing the sign of the time axis t. Under this operation 
t — —t,the velocity v, the momentum p and the angular momentum J change sign, 
while the force F remains unchanged under 7 . From the fact that the Coulomb and 
Lorentz forces are invariant, we derive the result that E — E and B — B under T; 
for the potentials, V — V but ASA 


9.2.1.3 Dipole Moments 


Elementary particles with spin may have electric or magnetic dipole moments. 
The spin 5 has the dimensions of an angular momentum, and therefore remains 
unchanged under parity and changes sign under time reversal. 

The potential energy of an electric or magnetic dipole in an external field is 
proportional to the scalar product of the electric or magnetic moment with the 
strength of the external electromagnetic field. Since the moments must be parallel 
to the spin, the potential energy is given by 


—deS-E for the electric case (9.4) 
and 
— dms: B forthe magnetic case (9.5) 


Here d, and dm are the electric and magnetic dipole moments, respectively. If we 
consider the transformation properties of 5, E and B under P and T, it turns out 
that for both operations dm — dm and de — —de. This means that observation of a 
nonvanishing electric dipole moment would violate any invariance under parity and 
time-reversal transformations. 

In classical physics, all processes are invariant under parity and under time 
reversal. In the case of mirror symmetry, this means that a physical experiment 
will lead to the same result as a mirror-imaged experiment, since the equations of 
classical physics are left-right symmetric. In a similar way, the classical motion of 
one particle can be reversed, e.g. by playing a film backwards, and this inversion 
of the motion corresponds formally to time reversal. Again, the laws of motion are 
invariant under 7, and the reversed motion follows the same path backwards as 
forwards. 

Of course, this is no longer the case if many particles move and interact with 
each other; in this case the second law of thermodynamics ensures that entropy is 
increasing, thus defining an arrow of time. 
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9.2.2 Discrete Symmetries in Quantum Systems 
9.2.2.1 Particle-Antiparticle Conjugation 


In relativistic quantum mechanics, the Dirac equation requires that for each solution 
describing a particle, there is a second solution with opposite charge, describing the 
antiparticle. The antiparticle of the electron, the positron, was found in 1933 [10], 
and the antiproton was found in 1955 [11]. The particle-antiparticle conjugation 
C transforms the field $ of the particle into a related field ø” which has opposite 
quantum numbers: the charge, lepton number, baryon number, strangeness, beauty, 
etc., for the antiparticle are opposite in sign to the values for the particle. 

Invariance under the C transformation is always valid in the strong and electro- 
magnetic interactions. This means, in particular, that the visible spectral lines from 
atoms and their antiatom partners are identical, and we cannot use these lines to 
identify antimatter in the Universe. 

This would be especially important in the science-fiction scenario in which a 
man-made spacecraft sets out to meet a distant civilization, where it would be 
advisable to know whether the other planet was made of matter or antimatter. In 
this case another means of differentiation would have to be found. 


9.2.2.2 Violation of Mirror Symmetry: Parity Violation in Weak 
Interactions 


Lee and Yang [13] suggested that of the four interactions—strong, electromagnetic, 
weak and gravitational—the weak interaction might violate mirror symmetry when 
it was described by a combination of vector and axial-vector currents in the 
Lagrangian (V-A theory). The interference of these two currents could lead to 
pseudoscalar observables which would change sign under the parity operation. One 
such observable is the scalar product of an axial vector (such as the spin of a particle) 
with a polar vector (such as the momentum of another particle in the final state). If 
the expectation value of this pseudoscalar is measured to be nonvanishing, then 
parity is violated. 

An experiment on the beta decay of cobalt-60 [14] measured exactly such an 
observable, the scalar product of the spin 5/1 of the ©°Co nucleus and the direction 
of the electron from its beta decay into an excited state of °°Ni with nuclear spin 
4h. The Co nuclei were polarized by embedding them in a cerium-magnesium 
crystal, where the magnetic moments were aligned by a weak external magnetic 
field of about 0.05 T. In the strong magnetic field inside this paramagnet, the ©°Co 
nuclei are polarized through hyperfine interactions if the temperature is low enough 
(0.01 K) to avoid thermal demagnetization. The polarization was measured through 
the asymmetry of y rays from the cascade decay of the Ni state. The measurement 
then required the detection of the electron direction relative to the polarization of the 
Co nuclei. The experimenters found that the electron was emitted preferentially in 
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a direction opposite to the external magnetic field, and that this effect disappeared 
when the crystal was warmed and the nuclear polarization disappeared. Thus, at low 
temperature, a nonzero pseudoscalar is observed, demonstrating parity violation. 

By comparing nuclear beta decays having an electron and an antineutrino in 
the final state with their counterpart with emission of a positron-neutrino pair, it 
was shown that the helicity h = $ - p/|s - p| of neutrinos is opposite to the one of 
antineutrinos [18]. 

Other experiments lead to similar results. The helicity h =s- p/|s- p| of 
the neutrino emitted in the weak electron capture by !??Eu was measured in 
an experiment by Goldhaber and Grodzins [19] to be negative; the neutrino is 
“left-handed”, i.e. the spin is aligned antiparallel to the momentum. Similarly, 
measurements of the polarization of electrons from £^ decay showed a negative 
value, with a modulus increasing with the velocity of the electron, v/c. Positrons 
from f* decay were found to have a positive polarization that increases with v/c. 


9.2.2.3 Violation of C Symmetry, and CP Invariance 


In the realm of weak decays of particles, supporting evidence for the violation of 
mirror symmetry came from the observation that parity is violated in the decay 
at pi is and that the muon neutrino from this decay is left-handed [15, 16]. 
The P-conjugate process, i.e. zz ^ — Vau, with a right-handed neutrino, does not 
occur. The same is true for the C-conjugate process, z — u` vy, with a left- 
handed antineutrino. However, if we combine the C and P operations, we arrive at a 
process zt — u Vy with a right-handed antineutrino, which proceeds at the same 
rate as the original z* decay, with a left-handed muon neutrino. Evidently, in weak 
interactions, P and C are violated, while it seemed at the time of those experiments 
that the process was invariant under the combined operation C - P. This argument 
can be visualized as in Fig. 9.2. Here the 7? mirror and the C mirror act on a left- 
handed neutrino, both leading to unphysical states, a right-handed neutrino and a 
left-handed antineutrino. Only the combined CP mirror leads to a physical particle, 
the right-handed antineutrino. This argument was made by Landau [17], suggesting 
that the real symmetry was CP invariance. 


9.2.2.4 CP Invariance and Neutral K Mesons 


One consequence of this postulated CP invariance was predicted by Gell-Mann 
and Pais [12] for the neutral K mesons: there should be a long-lived partner to 
the known V (K9) particle of short lifetime (10~!°s). According to this proposal, 
these two particles are mixtures of the two strangeness eigenstates K°(S = +1) and 
K9(S = —1) produced in strong interactions. Weak interactions do not conserve 
strangeness, and the physical particles should be eigenstates of CP if the weak 
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interactions are CP-invariant. These eigenstates are described as follows (where 
we choose the phases such that K? = CPK?): 


CPK; = CP E ELI = + (Ko +k°) SKi (9.6) 
CPK = CP E (K° -®)| = = (Ko - K?) m CR (9.7) 


Because CP |x tm) = |n * zt) for zx mesons in a state with angular momentum 
zero, i.e. the two-pion state has a positive CP eigenvalue, a decay into z^ is 
allowed for the K; but forbidden for the K»; hence the longer lifetime of K2, which 
was indeed confirmed when the K» was discovered [20, 21]. 


9.2.2.5 Discovery of CP Violation 


In 1964, however, Christenson et al. [22] discovered that the long-lived neutral K 
meson also decays to two charged pions with a branching ratio of 2 x 107°. 

The motivation of this experiment was twofold: the experimenters wanted to 
check on an effect found by Adair et al. [23] when the latter observed interactions 
of long-lived kaons (K2) in hydrogen, and they wanted to test CP invariance by 
searching for the decay of K» into two pions. Adair et al. had found anomalous 
regeneration of short-lived kaons (Ki) above expectation. “Regeneration” is an 
effect due to the different strong interactions of the two components of a long-lived 
kaon, K? and K?. This leads to a creation of a coherent K 1 component when a K2 
beam traverses matter. The anomalous effect above expectation was still observed 
in the experiment of Christenson et al. when the (K2) beam hit a hydrogen target. 

Therefore, Christenson et al. emptied the target and looked for Ka — ntr” 
decays from the vacuum. To their surprise, they found such decays, which meant 
that CP invariance was broken in this decay. 
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Fig. 9.3 The experimental setup used by Christenson et al. [22] for the discovery of CP violation 


The magnetic spectrometer used by Christenson et al. is shown in Fig. 9.3. 
On each side of the spectrometer, one charged particle is detected through spark 
chambers in front of and behind the magnet. The two vector momenta p; (i = 1, 2) 
of the two particles are measured. Assuming the mass of the particles to be the pion 
rest mass My their energies can be obtained from 


E? = p +m}. (9.8) 


The invariant mass of the pair is 


mag = [Gn E2 — (pi + Bo)?] , (9.9) 
and the kaon momentum is 
PK = pit po. (9.10) 


From the reconstructed kaon momentum, the intersection of the kaon flight path 
with the target plane gives an indication of whether this was a two-body decay 
coming from the target or a three-body decay with an escaping neutrino. 

In the latter case, the direction of px does not point back to the target source. 
The result of the experiment is shown in Fig. 9.4. A significant peak of K > mtr 
decays coming from the target direction (cos 6 = 1) is seen, while the background 
of three-body decays outside the peak can be extrapolated under the signal, and 
represents only ~20% of the data in the signal region: there is a signal at the level 
of 2 x 107? of all decays, and CP is violated. 

From then on the long-lived K meson state was called K; because it was no 
longer identical to the CP eigenstate K2. However, the physical long-lived state 
Ki, was a superposition of a predominant Kz amplitude and a small admixture 
of a Kı amplitude, Ki, = (K2 + £K1)/y 1 + |e|? where the admixture parameter 
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Fig. 9.4 (a) Experimental distribution of m* compared with Monte Carlo calculation. The calcu- 
lated distribution is normalized to the total number of observed events. (b) Angular distribution of 
those events in the range 490 < m* < 510 MeV. The calculated curve is normalized to the number 
of events in the total sample 


€ is determined by experiment to satisfy |e] ~ 2 x 10^? . Similarly, the short- 
lived state was called Ks, and Ks = (Kj + £K2)/ 1 + |e|?. The CP violation 
that manifested itself by the decay Kj, — z^; — was confirmed subsequently 
in the decay Kr, — z?z? [24, 25], and by a charge asymmetry in the decays 
KL > m*etv and Ky —> z-y*v [26, 27]. 


9.23 Discrete Symmetries in Quantum Mechanics 


The three discrete symmetries P, C and 7 are described by the operators P 
for the parity transformation, C for particle-antiparticle conjugation and 7 for 
time reversal. Invariance of an interaction described by a Hamiltonian H under 
a symmetry operation means that commutes with the relevant operator, e.g. 
[H, P] = 0. According to experimental evidence, the strong and electromagnetic 
interactions are P- and C-invariant. The corresponding operators are unitary, i.e. the 
Hermitian conjugate is equal to the inverse: 


gel. (9.11) 
Pİ = P8. (9.12) 


For two states |y) and |g), such a unitary transformation does not change the 
product: 


(1e) = We’), (9.13) 


where |y’) and |g’) are the transformed states. 
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By defining the intrinsic parity of the proton as +1, a phase convention for the 
fields can be chosen, such that the parity operator P has only eigenvalues of +1 
or —1, and all particles have intrinsic parities +1 as quantum numbers. These 
conserved quantities, which correspond to discrete symmetries, are multiplicative 
quantum numbers. 

The third discrete symmetry, time reversal, is a special case. The corresponding 
operator 7 is not unitary, but antiunitary. Here the bracket (y|g) is not conserved 
by the 7 transformation, but rather 


(w'le’) = (wie) . (9.14) 


Probability is still conserved, i.e. 


KY'A = Ile). (9.15) 


but the phases are not. The fact that 7 is antiunitary can be deduced from the 
Schródinger equation for a free particle, where the time derivative is odd under 
T while the Laplace operator A is even. This can be reconciled with 7 invariance 
only if 7 makes the changes i > —i and y > y*. 

CPT, the product of all three discrete transformations, being a product of two 
unitary and one antiunitary operator, is also antiunitary. According to the CPT 
theorem of Lüders [7] and Pauli [8], and Jost [9], a field theory with Lorentz 
invariance, locality, and the commutation relations given by the spin-statistics 
theorem, is CPT -invariant. At present there is no realistic field theory which 
violates CPT invariance. 

As a consequence of this theorem, violation of one of the three discrete 
symmetries implies a violation of a complementary one. If CP is violated, then 
T is also violated. 

The experimental consequences of CPT invariance are the equality of the 
masses, lifetimes and magnetic dipole moments of a particle and its antiparticle. 
These equalities have been tested with great precision, as shown in Table 9.1. 

A very special case in this context is that of the masses of the neutral K mesons. 
The mass difference between the long-lived Ky, and the short-lived Ks can be 
measured in interference experiments. This difference is due to second-order weak 


Table 9.1 Comparison of masses m, lifetimes t, and magnetic g-factors of particle and antiparti- 
cles 


Particle Im — mi/m Ir —T|/t le — zi/g 

e «4x 10-8 (—0.5 + 2.1) x 10-2 
m (1 8) x 1075 (—2.6 + 1.6) x 1078 
n 25) x 10-4 (5.5 € 7.1) x 10-4 l 

p «7 x 10-10 (0.3 £ 0.8) x 10-6 


K9 < 4.7 x 10-9 
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interactions and, therefore, is very small, about Am = (3.480 + 0.007) x 1076 
eV, which means that Am/mx < 107!4. From this, one can deduce very stringent 
limits on the mass difference between the K? and K°, of order 10-15, 

Thus, from experimental evidence, there is no doubt about the validity of CPT 
invariance. 


9.3 Mixing and Decay of Neutral Flavoured Mesons 


9.3.1 Particle-Antiparticle Mixing 


Neutral mesons (represented by N? in this chapter) with a characteristic quantum 
number, such as the strangeness S for K? mesons, charm C for DÓ mesons and 
beauty B for B? and BS mesons, have the particular property that they can mix with 
their antiparticles, which carry an opposite-sign quantum number. Weak interactions 
do not conserve any of these quantum numbers (S, C, B); consequently N° and 


N? can mix by second-order weak transitions through intermediate states such as 
2x, 37, Tt Uv, zt ev (for K°), or 2K (for B®). The states that obey an exponential 
decay law are linear superpositions of N° and NO, 


o IN?) + 8 |N?) = (2) (9.16) 


The time-dependent Schródinger equation then becomes a matrix equation 


tala) - *() 2n 
d) T iu 


where Xj, = Mix — il ;ik/2, and Mix and Ij, are Hermitian matrices, called mass 
matrix and decay matrix, respectively. Both of the latter two matrices are Hermitian: 
M = Mt and P = TÏ. However, X is not Hermitian. The elements of the matrix X 
are 


Xi = (N°HIN°), Xa = (NO[HINO) , 
X12 = (N°}HIN®) , X21 = (NOIHIN?) (9.18) 


where CPT invariance requires the diagonal elements to be equal: X11 = X22. The 
matrix has the form 


| ha acm (9.19) 
mp- EB m—5r 
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The off-diagonal elements of the matrices are given by 
Dy = 22 3 1 0, P Hw | F) CF Hw IN?) , (9.20) 


where the sum runs over all possible physical intermediate states F, which have a 
phase space density o, . Similarly, 


= NO 0 
Mp, = (N°|HwIN°) + M7 a en (9.21) 


M o — Mn 


where the sum extends over all possible virtual intermediate states n. 

The eigenvalue equations for X yield two eigenstates, which can be labeled by 
their mass: h for the higher mass, / for the lower mass. These eigenstates are the 
physical particles with a definite mass and an exponential lifetime distribution. The 
eigenvalues M, and M of the matrix X are 


i 
Mnr — my — I5, 
h h 3h 


My = mı — ;n (9.22) 


We denote the differences between the physical quantities by AI' = I, — T; and 
Am = my — m, > Q and denote the average values by 


|. Intl 


r , 9.23 
5 (9.23) 
m= mom , (9.24) 


For the BÜ—B0 system and the D°—p? system, the two decay widths J}, and Ij are 
expected to be nearly equal (because the numbers of final states for the decay are 
very similar). In these cases it is customary to introduce the dimensionless quantities 


_ Am 


r= 9.25 
and 
y= 3 9. 


Here x is positive by definition, and y varies between —1 and +1. For heavy systems 
such as B®, |y| is expected to be much less than 1, while for the KÌ system, y is 
found experimentally to be close to —1, since here the decay width of the lighter 
state is 600 times larger then that of the heavier state. Therefore, for the K? system, 
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Table 9.2 Parameters of the four neutral oscillating meson pairs [28] (see Sects. 9.5.5.7 and 9.6.4) 


K?/K9 D/D? B?/B0 BÜ/BO 
t [ps] 89.59 + 0.04 |0.4101-£0.0015 | 1.519+0.004 1.5102-0.007 
51160 +210 
r [10?5-!] 5.61 | 2.44x 103 (6582) (66243) 
y= AT/QT) | —0.9965 (0.6452-0.008)x 107? ||y| $0.01 | —(0.0622- 0.005) 
Am [109s-1] |(5.286-:0.011) | <70 (5072-2) (17.762:0.02)x 10? 
Am [10-9eV] |(3.4802:0.007) | <5 (33443) | (11.6940.13)x10? 
x- Am/T  |0.945:0.002 | <0.03 0.7692-0.004 | 26.81+0.10 


the lighter state is called Ks (“short-lived”) and the heavier state Ky, (“long-lived”). 
Table 9.2 gives a summary of various parameters of oscillating meson pairs. 


9.3.2 Decays of Neutral Mesons 
9.3.2.1 Time-Dependent Schrödinger Equation 


From the time-dependent Schródinger equation for mixed states given above, it 
follows that 


d 2 Zio * Q* a 
a (et +16?) = enm) (2). (9.27) 


Since both of the neutral mesons N° and N decay, the left-hand side of this equation 
is negative for any o or f. Therefore IT is positive definite, in particular T11, 75» and 
det I” are positive. 

The physical particles, which have a definite mass and lifetime, are mixtures of 
the eigenstates N° and N° of the strong interaction, which carry definite values of 
their characteristic quantum numbers strangeness S, charm C, and beauty 

If the weak interaction through which these particles decay is invariant under 
a discrete symmetry, say CP, then the physical particles are eigenstates of this 
symmetry because Hw commutes with CP. 

The effect of discrete symmetries on N° and N? is the following: CP is unitary, 
and there is an arbitrary phase a: 


d 


CP|N?) = e|N9) , 
CP|N®) = e~t N°) . (9.28) 
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CPT, however, is antiunitary, and, with an arbitrary phase b, 

CPT|N9) = ei^|NO) , 

CPT|N9) = e? |N?) . (9.29) 
For 7 , which is also antiunitary, we obtain 

TIN?) = el-4 (ND) 

T|N9) = ei&*oN9) (9.30) 


We choose the arbitrary phase a to be equal to 0 here, such that the eigenstates of 
CP are 


IN.) = (IN) i9) , 
IN-) = 7, (INh - i9). (9.31) 


with the property that they have CP eigenvalues +1 and —1: 
/2CPIN4) = CP|N)) +CPIN®) = |N°) + |N?) = V2|N,) 
J/2CP|N_) = CP|N°) — CP|N®) = |N?) — |N?) = —V2|N_) (9.32) 


Historically, in the K? system, |N.,) was designated by |K;), and |N_) by |K;). 
Discrete symmetries impose certain conditions on the elements of the mass and 
decay matrix. CPT invariance requires the masses and lifetimes of the particle 
and antiparticle to be equal, i.e. X1; = X22, or My, = Mo» and I; = I», for the 
diagonal elements. CP invariance requires that | Xj?| = | X21|. In the following, we 
assume CPT invariance. 
The eigenvalue equation for the matrix X yields 


Au = Am — ZAT e XI. (9.33) 


The corresponding eigenvectors of X are written 


IN4) = pIN9) — qINO) , 
IN;) = pIN9) + qIN®) , (9.34) 
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or in the form of the corresponding relations 
0 1 
N) = 57 (Na) + INi) 
P 
— -1 
IN9) = zg (Ni) — INI) . (9.35) 
q 


Unitarity requires |p|? + |q?| = 1, and CP invariance would mean p = q = 1/V2. 
In the case of CP noninvariance, an asymmetry parameter can be defined by 


= lcT6& 
Rugs go dics (9.36) 
pt+q q Les 
and we obtain 
la ee 
53m I» J-1iS8m M 
Qr. EUR LEM (9.37) 
Am — 5^T 
where 3m M15 >> 3m I1? for the K meson system, and therefore 
2 Am 
arg & ~ arctan (9.38) 
In this case the two physical states are not orthogonal, and we obtain 
Hes (9.39) 
ee TH eR | 


For the eigenstates of the time-dependent Schrédinger equation, the time evolution 
obeys an exponential decay law, as can be shown in the Wigner—Weisskopf 
approximation. Here, the time ¢ is measured in the rest frame given by the common 
mass defined by the strong and electromagnetic interactions. The time evolution is 
given by 


IN, (r)) = eii $T IN, O)) , 
IN; (2) = e$! IN, (9) . (9.40) 


In this way, N, decays as exp(— Iht) and N; as exp(— 171), while the phases of the 
two states evolve with different frequency, and this difference will show up in any 
interference effect between the two decaying mesons. 
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On the other hand, if initially a pure flavor state N? or N? is produced, the decay 
law is not exponential but shows oscillations. If we define the complex quantities 


. I 
yh = umg-d 7 
. "n 
y = amic PE (9.41) 


for the heavy (A) and light (7) meson states, respectively, then the amplitude for an 
initially pure state N° at time t = 0 is given by (9.34), and at a finite time t the two 
components evolve according to (9.40). At this time, the state is 


1 2 
N= 5 (v (e! + e?!) — INO (e! — em) ; (9.42) 
P 
The probability of finding an N? after a time f, starting from an initially pure N? 
state is 


2 
q 


p 


——— Í 
PIN Nes 


[e Tht 4 Tit _ 2e™7" cos (Am )]. (9.43) 
If we express this in the unified variables T = It, x and y, this reads 


P(N? > NO) = gt (cosh yT — cosxT) . (9.44) 


Similarly, the probability of finding an N in an initially pure N state is 
0 0 Ir 
PIN —^N)- 3° (cosh yT + cosxT) . (9.45) 
The difference between these two probabilities is then 


P(N? — N°) — PN? > NO) 


1 1 a 
RT (cosh yT + cosxT) — ae 2 
2 2 p 


1 2 
zB cosh yT |1— 4 +cosxT [1+ 4 
2 p p 


q 2 
cosxT 
p 


cosh yT — p 
2 
)| : (9.46) 
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However, the two states N;, and N; are not orthogonal, but their overlap ô is 


2 
E 
p 2Ree 
8 = (NalN) = Ip? — l4? = ——— = z. (9.47) 
1 H 1+ [el 
tiz 


From this result, we obtain 


E 
P(N? > N°) - P(N? > NO) = zT («E 
p 


2 
(ô cosh yT + cosxT) . 
(9.48) 
Similarly, 


M 
PN? > N°) + PN? > ND) = se" ( + 4 


2 
(cosh yT + dcosxT) , 
p 


(9.49) 


and the flavor asymmetry at time T in an initially pure flavor state becomes 


| PN? > N°) - P(N® > NO) cosxT + Scosh yT 


= —— = ———————— , 9.50 
P(N? — N°) + P(N0 — NO)  coshyT + ócosxT 


A(T) 


This function behaves very differently for the neutral K, D, and B meson systems. 


9.3.2.2 Decay Asymmetries and CP 
We define the decay amplitudes of neutral mesons to a final state f as 


Aş = (fITIN®) , 
Ay = (fITIN®) . (9.51) 


The decay amplitudes of the mass eigenstates are then 


Al = pAs —qAf, 
Al, = pAf t qÀf, (9.52) 


and we define the complex quantity 


sun 


l (9.53) 
pAf 


Àf 
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The moduli for the decay of N to f and the decay of N to its CP conjugate state 
f are equal if CP is conserved, and vice versa: 


IA sl = [AF], 
|AF| = [A r| j (9.54) 


(If f is a CP eigenstate, this is simplified to |A f| = [A p. 
Now CP violation may occur in three different ways: 


1. CP violation in the mixing, if |g/ p| Z 1, called “indirect CP violation". 

2. CP violation in the decay amplitudes, when (9.54) is not valid, called “direct CP 
violation". 

3. CP violation in the interference, when the phase of the expression 


Fea ae ot? 
AfA ATAT D /4 (9.55) 


is not zero. 
These three types of CP violation are characterized by the following details: 


1. CPviolation in the mixing. This type of CP violation will show up if the mass 
eigenstates of a neutral meson system are different from the CP eigenstates, i.e. 
if |q/ p| Æ 1 (or € z 0) and if there is a relative phase between M12 and T12. For 
the neutral kaon system, this is evident from the existence of the decay KL —^ 
mm, where |s| ^ 2 x 10 ^, and from the charge asymmetry in semileptonic 
decays ôL which is proportional to 2 Re e. For the neutral B system, this effect 
could be seen also in the charge asymmetry of semileptonic decays 


| TBH > l*vX) - P (B*(r) ^ I7$X) 


"SL F0) > Pv) + rB > -PX ` 


(9.56) 


This asymmetry is expected to be small in the Standard Model of order AI'/ Am 
or O(10~3). 

2. CPviolation in the decay amplitude. 'This effect appears if the decay amplitude 
A y of the neutral meson N to a final state f is different from the amplitude AF 


of the antiparticle N° to the charge-conjugate state f, i.e. |A>/ Ay| # 1. In the 
neutral-kaon decay to two x mesons, this is realized by the interference of two 
decay amplitudes, one with AJ = 1/2 to an isospin J = O state, and another 
with AJ = 3/2 to an isospin J = 2 state. The amplitude of direct CP violation 
is denoted by &' and proceeds through penguin diagram processes. The observed 
magnitude of this amplitude is |e’| ~ 4x 1075. In the neutral B meson system, the 
required two decay amplitudes with different weak phases and different strong 
phases could be a penguin diagram and a tree diagram, e.g. for the decay to 
the final state K z^ or K*z . The b — s penguin diagram has a dominant 
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contribution from a top quark loop, with a weak coupling V7, V;; and an isospin- 
1/2 (Kr) state. The tree diagram for b — u + (us) has a coupling V7, V, and 
leads to / — 1/2 or 3/2 states. The observed quantity is the decay asymmetry: 


| N(B > Krt) - N(B? > Ktn”) 


PEEL send uiu iab clc du M (9.57) 
N(B? — K-z*) + N(B? > K*z-) 


Asymmetries in the order of 10% and 025% have been observed for the B? and 
the B? meson respectively. 

3. CPviolation in the interference. Here the time dependence of the decay of an 
initially pure flavor state to a final state f is different for an initial particle or 
antiparticle. The final state can be a CP eigenstate such as nr (CP = +1) 
or J/VKs (CP = —1). In the kaon system the observed effect is m & ~ 1.6 x 
1073, while in the B system it is a very large asymmetry of order O(1). 


9.4 Models of CP Violation 


After the discovery of CP violation in K decay, a host of theoretical models 
was proposed to allocate this phenomenon to known interactions. Assuming CPT 
invariance of all interactions, the observed CP-violating effects in K decay imply 
also 7 violation (the experimental data of Sect. 9.5 are even sufficient to prove T 
violation without CPT invariance). In general, with CPT invariance, there are four 
combinations of violations possible: 


(a) T -conserving, C-violating and P-violating; 
(b) T -violating, C-conserving and P-violating; 
(c) T-violating, C-violating and P-conserving; 
(d) T -violating, C-violating and P-violating. 


Parity conservation in strong and electromagnetic interactions has been tested, 
for example by looking for a circular polarization in y rays from nuclear transitions. 
The presence of a wrong-parity admixture in one of the nuclear states involved will 
cause a small amplitude for a y transition with abnormal multipolarity that can 
interfere with the dominant amplitude and cause such a circular polarization. In the 
experiments of Lobashov et al. [46], polarizations of the order of 107? have been 
measured. These are consistent with being due to the two-nucleon force np — pn 
induced by the weak interaction (see [47] for a review). 

From many experiments of a similar nature, one can infer that the strong and 
electromagnetic interactions are not of type (a), (b) or (d). Therefore, if the source 
of the CP-violating phenomena is located in the strong or the electromagnetic 
interaction, there must be a part of one of those interactions that belongs to class 
(c), i.e. C- and 7 -violating, but P-conserving. 
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The proposed models can be grouped into the following four categories: 


1. Millistrong CP violation models [29-31] postulate the existence of C- and 7 - 
violating terms of order 10^? in the strong interaction. The process Ki, — 
xt is supposed to occur by the interference of two amplitudes: first, the K 
decays via the normal CP-conserving weak interaction, with AS = 1, into an 
intermediate state X, and then this state decays into m*z~ by a 7 -violating 
strong interaction. The amplitude of the process is of order G, a, where Gp 
is the Fermi coupling constant and a is the coupling of this CP-violating 
strong interaction. From the experimental value of |... |, one can conclude that 
a ~ 1979, 

2. Electromagnetic CP violation models [32-35] require large parts of the electro- 
magnetic interaction of hadrons to be C- and 7 -violating, but P-conserving. A 
two-step process Kr, — X — 2x could then occur through the interference of a 
weak and an electromagnetic CP-violating amplitude. The product of G, with 
the fine structure constant o is not too far from G, x 1073, as required by the 
magnitude of |n... |. 

3. Milliweak models assume that a part, of the order of 1073, of the weak interaction 
is CP-violating and is responsible for the observed effects. The decay Kr, — 27 
is then a one-step process, hence the name “direct CP violation", and CP or T 
violations of the order of 10^? should show up in other weak processes [36-44]. 
In these models, based on two doublets of quarks, CP violation is introduced in 
different ways. In one example [44], CP violation is due to the Higgs couplings, 
with flavor-changing neutral currents allowed; in another one [42], it is due 
to right-handed weak currents. A bold alternative was considered in 1973 by 
Kobayashi and Maskawa [48]: they saw that if there are three doublets of quarks, 
there is a possibility of CP violation in the 3 x 3 weak quark mixing. Today, with 
six quarks observed, this seems the most natural modelas discussed below. 

4. The superweak model [45] postulates a new AS = 2 CP-violating interaction 
that has a coupling (coupling constant g) smaller than the second-order weak 
interaction. This interaction could induce a transition Kg — Ks, with a subse- 
quent decay Ks — 27r. More precisely, this interaction would cause a first-order 
transition matrix element 


Mas = (K|HswlK) ~ 8Gp . (9.58) 


The mass difference itself is related to the second-order weak matrix element 


(KI? In) (n| Hy | K) 
Ms, = ee .. 9.59 
KE. 3 E, — E, i£ dade. 
where n is an intermediate state with energy E, and Hy is the weak 
Hamiltonian. In order that the CP-violating amplitude for Kr, — 27 relative 
to the CP-conserving amplitude should be of the observed magnitude, the 
ratio M w/Mgg must be of the order of 10 ?. Since M, © gG, and 
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My ^ Grim’, where the proton mass mp is used as a cutoff in the integration, 
this yields g ~ G m$, x 1078, 

This superweak interaction can be detected only in the K-Ks and BÓ-B? 
systems because these are the only known pairs of states with such a small 
difference in energy that they are sensitive to forces weaker than the second-order 
weak interaction. The clear prediction of this model is that there is no direct CP 
violation in the decay. 


For models other than the superweak one, violations of CP or 7 should manifest 
themselves in other reactions of particles or nuclei. One observable is the electric 
dipole moment (EDM) of the neutron. Most milliweak models predict this EDM 
to be of order 1077? ecm to 107?^ ecm, while the superweak model predicts 
107? ecm. The present experimental upper limit is 0.63 x 1077? ecm 

One of the milliweak models mentioned above under item 3 of the enumeration 
which is rather clear in its predictions should be noted: this is the idea of Kobayashi 
and Maskawa (KM) dating from 1973 [48]. At the time of the discovery of CP 
violation, only three quarks were known, and there was no possibility of explaining 
CP violation as a genuine phenomenon of weak interactions with left-handed 
charged currents and an absence of flavor-changing neutral currents. This situation 
remained unchanged with the introduction of a fourth quark because the 2 x2 unitary 
weak quark mixing matrix has only one free parameter, the Cabibbo angle, and no 
nontrivial complex phase. However, as remarked by Kobayashi and Maskawa, the 
picture changes if six quarks are present. In this case the 3x3 unitary mixing matrix 
Vig naturally contains a phase ô, in addition to three mixing angles (Sect. 9.7). It is 
then possible to construct CP-violating weak amplitudes from “box diagrams" of 
the form shown in Fig. 9.5. 

In the K°-K® system, this amplitude is proportional to the product of the four 
weak coupling constants G? Vis V Via ie If there is a nontrivial phase 6 in the 
unitary mixing matrix, then the product is a complex number, with the imaginary 
part depending on the phase 6. This leads to time-reversal (7) violation and to CP 
violation. The CP-violating mixing parameter for the kaon system, £, is given by 


_ G? fe mk ms 
G4/222^m 


By 3m(Via V*)F (m2, m2) : (9.60) 


Fig. 9.5 Box diagram for K0-K9 mixing connected with the CP-violating parameter € 
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Here G, is the Fermi constant, fk the kaon decay constant, Bx the kaon bag factor 
(0.80 + 0.15), and F(M?, m?) the loop function due to interference of the top and 
charm graphs, given by 


F(m?, m?) = Re (VÄ Vea) [mso (m2) — 7359 (n. m) — Re (VS Via) 250 (m?) ; 
(9.61) 


where So (m2) is a kinematical factor, and 5, n2, and 73 are QCD correction factors. 


For the B9-B9 mixing, a similar box graph applies, with the s quark replaced 
by a b quark. Here the amplitude is proportional to G? Vio Vi Via Vý. Analogous 
diagrams can be calculated for B? (b5) mixing and for D? (cd) mixing. 

All CP-violating amplitudes in the KM model are proportional to the following 
product of the three mixing angles and the phase ô (Sect. 9.7), 


J = |Vus Vub Vep sin ô| (9.62) 


A necessary consequence of this model of CP violation is the non-equality of the 
relative decay rates for Kj, > zz and Ky — zt 9x. This “direct CP violation" 
is due to “penguin diagrams" of the form given in Fig. 9.6 for kaon decays. The 
amplitude for this direct CP violation is denoted by &'. In kaon decays, it will show 
up in the interference of two decay amplitudes, with the final two-pion state having 
isospin 0 or 2 (Ao and A» in (9.91)). With six quarks, the weak quark mixing through 
flavor change can carry a nontrivial phase ô in the mixing matrix, and therefore 
can induce a CP-violating difference between weak decay amplitudes, such that 
EVE [A y]. This model gives an explicit origin of direct CP violation with a 
predictable size. In the kaon system, these asymmetries are very small because of 
the small value of |J| ^ 3 x 1075, the suppression of AJ = 3/2 currents, and the 
partial cancellation of two penguin graphs, called Q6 and Qs, shown in Fig. 9.6. 
However, in the B® system, the asymmetries of the decay rates to CP eigenstates 
can be very large. Examples are the decays B? > J/WKg and B? > ntr”. 

The main models which could be tested experimentally were the KM model and 
the superweak model, and the decisive question was the existence or non-existence 
of direct CP violation. For the kaon system in the superweak model &' = 0, and 


Fig. 9.6 Penguin diagrams for K? — 27 decay with direct CP violation (amplitude s^). The 
graphs correspond to the Wilson operators Qg and Qg and give rise to amplitudes with opposite 
signs 
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the decay rates of Kr, to mtz" and to zx? are equal. The predicted value of s’ 


within the KM model can be estimated if one infers the magnitude of the mixing 
angles from other experiments and if the hadronic matrix elements for box graphs 
and penguin graphs are calculated. Typical values of |e’/e| are in the range -(0.05- 
2.0) x 10 for three generations of quarks. A measurement of this quantity to this 
level of precision therefore becomes the experimentum crucis for our understanding 
of CP violation. If e’ is orthogonal to £, then a measurement of the phases of n4— 
and of noo (see Eqs. (9.79) and (9.80), respectively) can help to detect a finite value 
of Sim (e' /&). If, however, the phase of e' is close to that of £, and since |e’/e| « 1 
to a good approximation, we obtain 


1 j 2 
-— (=) n ( D ) . (9.63) 
e e) 6\ m- 
Various methods have been used to calculate the value of te(e’/e). Owing to the 
difficulties in calculating hadronic matrix elements in the penguin diagrams, which 
involve long-distance effects, the task turns out to be very difficult. In particular, the 
electroweak penguin diagram (corresponding to the Wilson operator Qg) and the 
QCD penguin diagram (operatir Qg) yield contributions of opposite sign and lead 
to a partial cancellation in the result &'/e. At the time of this review, great progress 
has been made by lattice calculations which now can be compared to the two groups 
of earlier analytic calculations. 
The lattice calculation of the RBC-UKQCD-collaboration [49, 50] imply the 
following values of the penguin graph matrix elements [51, 52]: 


Bg = 0.57 +0.19 and Bg = 0.76 + 0.05. (9.64) 
From this the lattice method obtains: 


e'/e = (1.9 + 4.5) x 1074 (9.65) 


On the side of analytic calculations, one method was based on the limit of large N, 
where N is the numbers of colours in QCD [53] (“dual QCD” or “DQCD”). At large 
N, QCD becomes a theory of three mesons. Here one gets Bg = Bg = | at the pion 
mass scale. Considering the meson evolution of these matrix elements to the mass 
scale of 1 GeV [54], Buras and Gerard obtain a suppression of both Bg and Bg. At 
the scale of the charm quark mass, they find 


Bg < 0.6 and Bg — 0.80.1, (9.66) 


in agreement with the lattice results. using the lattice values for Bg and the relation 
Bo < Bs, they find an upper bound; 


e'/e € (6.0 2.4) x 107^ (9.67) 
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However, an alternative analytic calculation [55] based on chiral perturbation theory 
which emphasizes final state interactions and neglects the meson evolution, obtains 
a much larger value 


e'/e = (15 +7) x 1074. (9.68) 


Further improvement of the lattice calculations will hopefully clear-up this impor- 
tant discrepancy of the standard model prediction for e'/e. 


9.5 The Neutral K Meson System 


9.5.1 Mass Eigenstates and CP Eigenstates 


The eigenstates of strangeness are K? (S = +1) and KO (S = —1), the CP eigen- 
states are K; (with CP eigenvalue +1) and K» (CP eigenvalue —1), and the mass 
eigenstates are 


IKs) = p |K?) + g|K®)  (Short-lived) , (9.69) 
IKL) = p IK?) — q|K®) (Long-lived) , (9.70) 


where, from experiment, Ky is the heavier state (A). The lifetimes of the two 
eigenstates are very different. While the short-lived particle (Ks) has a mean 
lifetime of (0.8959 + 0.0004) x 107!° s, the long-lived particle Ky, has a lifetime 
of (5.17 + 0.04) x 107? s, i.e. 600 times larger. This is due to the fact that the 
dominant CP-conserving decays are Ks — 2m and Ky —> 3x, mev, muv, with 
a much smaller phase space for the three-body decays. Using the parameter 
€ = (p — q)/Cp + q), (9.69) and (9.70) can also be written 


|K (Ki) + €|K2)) , (9.71) 


) 1 
s) = -= 
y 1+ lel? 

1 
K1) = —— 
v1 + lel? 


The long-lived state is therefore mainly a state with CP eigenvalue —1, with a 
small (2 x 107?) admixture of a CP +1 state Kj. The two mass eigenstates are 
not orthogonal if CP is violated, because (Ks|Kr ) = 2 Nee. 

If the validity of CPT symmetry is not assumed, the expressions are generalized 
to 


(JK2) + e€|K1)) . (9.72) 


1 
[Ks) = ———_(IK}) + (e + 4)|Ka)) , (9.73) 
1+ |e + ô|? 
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1 
IKL) = —————2 (Ko) + (e — 6)|K1)) . (9.74) 
1+ le — ô|? 


with a CPT violating parameter ô. 


9.5.2 Isospin Decomposition 


In Ks, — 2x decays, the angular momentum of the pions vanishes. The spatial 
part of the wave function is therefore symmetric, and since pions are bosons, the 
isospin wave function must be symmetric too. The two symmetric combinations of 
two I = 1 states have J = 0 and J = 2, and the four transition amplitudes that exist 
are 


(JT|Ks), | QITIKs).  (I|TIK)). — (2|T Kr) . (9.75) 


These can be reduced to three complex numbers by normalizing to the amplitude 
(0|T|Ks): 


O|T|K 
ats (ITIK) ; (9.76) 
(OIT IKs) 
| (2IT|K1) 
£9 = ——, (9.77) 
tU V2 (OITIKs) 
2|T|K 
o = QITIKs) . (9.78) 
(0|T |Ks) 
The experimentally observable quantities are 
G^ |TIKi) 
b> RSS (9.79) 
(atx |T|Ks) 
(rn^ ITIK) 
= =>, 9.80 
700 7 (nT Ks) oo 
I(Kj,— nx l*v) - (Kj => rtr Y) 
(ea UN (9.81) 
(Ky > x-[*v) + (Kr > z*17v) 
Relating the isospin states to the physical 27r states 
(0| — lani = - Ec erg + LET (9.82) 
3 4/3 V3 
1 2 1 
2| = (at | ae won| + — (n |. (9.83) 
EET B J/6 
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we obtain 
E0 + €2 
ny- = ——————, (9.84) 
1+ (1/4/2)0 
£0 — 262 
= ———. 9.85 
700 loia (9.85) 


Because of the validity of the ^7 = 1/2 rule for CP-conserving weak nonleptonic 
decays, œw < 1 and therefore can be neglected. 

A suitable choice for the phase of the K? — 2; (I = 0) amplitude is obtained 
by choosing this amplitude to be real except for final-state interactions between two 
pions, leading to a phase shift ôo: 


(O|7|K°) = e!°Ay and Ag real. (9.86) 
Similarly, 
Q|T|K9) = e? A, . (9.87) 
With these choices, we obtain 


£0—€8, (9.88) 


i ; Sm A 
gj = iOD e, (9.89) 
J/2 Ao 
Therefore, representing € and &' in the complex plane, we obtain the triangle 
relations 


na -— 8g, noo = £ —2s' . (9.90) 
In this way, 7+—, noo and 3e' form a triangle in the complex plane, the Wu-Yang 
triangle. The CP-violating decay amplitude &' is due to interference of AJ = 


1/2 (Ao) and AI = 3/2 (A2) amplitudes: 


, ism Ag ei(d2—d0) 


= 91 
m (9.91) 


Its phase is given by the zz phase shifts in the J = 0 and J = 2 states, do and 65, 
respectively, assuming CPT invariance: 


arg(e’) = (82 — ôo) + = l (9.92) 


The zz phase shifts have been measured precisely in pion-scattering experiments. 
The results obtained are 62 = (—7.2+ 1.3)? [151] and 69 = (39 + 5)? [152]. Using 
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dispersion relation calculations or chiral perturbation theory, these results can be 
used to extract arg(e’). The results are (42.3 + 1.5)? [149] and (46.0 + 3.6)? [150]. 
The decomposition of the observable decay amplitude into £ and &' corresponds 
to a separation of the CP-violating effects due to the mass and decay matrices 
(represented by £), which are seen also in the impurity of the Ky and Ks states, 
from CP violation in the transition matrix element (represented by £”). 
The phase of € is given by (9.37) and (9.38): 


2Am 
arge = dp + arctan ( f ) ; (9.93) 
S 


where Am = m, — ms and 


dy = — arctan ( (9.94) 


sm I12 
23m Mi» - 
If there is no strong CP violation in the channels K > 27 (J = 2), K > xlv, 
and K — 3x, ®p is very small. This can be deduced from the Bell-Steinberger 
unitarity relation [156]. If the final states of the Kj, and Ks decays are designated by 
|F}, then 


Ts = 3 WFITIKS)? , (9.95) 
F 

T = » (FITIK? . (9.96) 
F 


Unitarity leads to the relation 


i (Ms — Mr) (KsIKi) = È (FITIKL)* (FIT IKs) . (9.97) 
F 


If CPT invariance is not assumed, the left side of this unitarity relation includes a 
contribution of the CPT violating parameter ô. It then has the form [156]: 


(1 titan Psw)[Re e — im 8] (9.98) 


The mass matrix elements are then (with CPT invariance assumed) 


M: M 
X11 = X2 = M (9.99) 
_ (Ms — Mi) + €) 
Xn = ECC DOT NN (9.100) 
X2 = (Ms — MU =€) (9.101) 


2(1 +€) 
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Ignoring all final states but 27 or assuming 3m I1? = 0, we obtain the following 
from the unitarity relation or from (9.37): 


2^m 
arg £ = arctan ( ) = Øw (9.102) 
Is 


where ® w designates the phase in the superweak model 
When we add other final states of CP-violating decays, the phase is shifted by 
Gp, and an upper limit can be obtained from the unitarity relation: 


0.75 
dv I'p,cpv + l'F,cPC (9.103) 


D <= 
Ts|n+—| 


where the sum runs over all states F Z 27 and the root is taken of the product of 
the CP-violating (CPV) and CP-conserving (CPC) decay rates. 

Present limits on CP-violating processes in these decays show that contributions 
from semileptonic decays are negligible. Using the limits on (AQ = AS)-violating 
amplitudes, we obtain 


\Dp(Ke3)| < 0.07? , |Sp(K,3)| < 0.05° . (9.104) 


In the same way, the measurement of the CP-violating part of the Ks — zr ^ zt? 


decay [153-155], 
rK +_—_0 
D(Ksmmnm) = 9 x107, (9.105) 
Is 
allows us to set the limit 
|Dp(atx~7°)| « 0.05? . (9.106) 
Similarly, from the limit [158, 159] r (Ks ^ 3n9)/Ts < 2.8 x 107? it follows that 
Ip (3x9)| «0.02. (9.107) 
New, more sensitive experiments will improve this limit. If we use the experimental 
values of Am and Ts from Sect. 9.5.5, then arg ¢ = (43.4 + 0.1 + 0.17)?, where the 
first error comes from the uncertainties of Am and Ts and the second error from the 


uncertainty of dp. 
Another independent observable is the charge asymmetry 


ndis 


ôL = i re (9.108) 
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where x — g/f is the ratio of the AQ — —AS to the AQ — AS amplitude 
(Sects. 9.5.3.2 and 9.5.5.6). 


9.5.3 Interference Between Decay Amplitudes of Ky, and Ks 


An arbitrary coherent mixture of Kj, and Ks states will show interference phenom- 
ena when decaying into 27 and in other common decay channels. According to 
Sect. 9.3.2.1 the eigentime development of K; is 
IK) > |Kr) eMe , (9.109) 
where My, = my — (1/2)11, and correspondingly for Ks. An arbitrary mixture 
[V(0)) = as|Ks) + aLIKr) (9.110) 
will develop into 


|y (t)) = ase 1 5* [Ks) + ape" [Ki) . (9.111) 


We call the ratio as/a = V. 


9,5.3.1 2x Decay 
The 2z decay amplitude is therefore 
Qz|T|y(x)) = as e! 5* (27x |TIKs) + ace" (2x|T|K1) 
= (2x|T |Ks)as e 1 5* + apne iMt, (9.112) 


where n = n+- for zt * z- decay and n = noo for 2°7° decay. The observed decay 
rate is proportional to 


R(t) = Jag)? e5* + Jar, n|? 7*5 + 2Jasllarlin| e MEY cos (Amt +D) . 
(9.113) 


where ® = arg(as) — arg(naL). We obtain for various initial conditions of the 
mixture: 


1. For an initially pure K? state (as = 1 = ar), 


R(t) = e5* + [n e7Ti* + 2]g| e UU*T90/2 cos (Am v — arg) . 
(9.114) 
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2. Foran initially pure Ko state, the interference term changes sign. 
3. For an incoherent mixture of K? (intensity N,) and K? (intensity Ne). the 
interference term is multiplied by the “dilution factor" 


N, — N- 
MULUS (9.115) 
Ny, + Ne 
Measurement of the interference term under these conditions is called the vacuum 
interference method. 
4. For the coherent mixture behind a regenerator, as = ọ, aj, = 1, and we obtain 


R(t) = lol? e 15* + |n? e T7 + 2elln e C179 0/2 cos (Amt + Gg — arg n) . 
(9.116) 


9.5.3.2 Semileptonic Decays 


Interference phenomena and C7 violation can also be observed in the decay of a 
coherent mixture of K? and K? mesons into semileptonic final states. In particular 
the time-dependent charge asymmetry 5(t) = (Nt — N—)/(N* + N7) shows an 
oscillatory behavior, where N+ denotes decays into zt e^ v final states and NT into 
zx ety. Assuming CPT invariance, we obtain 


1- |x? 
IL — x|? 


8() =2 [hee + [V] e 0/208-TU* cos (Am t + 9v) eo (17) 


where x is the ratio of amplitudes with AS = —A Q and AS = AQ. x is consistent 
with zero, in agreement with the AS = AQ rule. 

For an initially pure Kj, beam (R = 0), the asymmetry is independent of the 
decay time: 


|l x^ 


Ix? 
ôL —23tee 


(9.118) 


For an initial incoherent mixture of K? (Nx) and K? (NZ) the quantity |R| has to 
be replaced by (Ny — N.-)/(Ny + NZ), i.e. by the same dilution factor as for 27r 
interference in a short-lived beam. 

For the coherent mixture created by a regenerator, R is given by the regeneration 
amplitude o, and ®p by the regeneration phase $p. 
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9.5.4 Detection of K? Decays 


The main decay modes originating from KÜ's in a neutral beam and their respective 
branching ratios are [163] 


Kt z*etv (40.55 +0.11)% Kg; 
KL z*ut*v (27.07 +0.07)% K3; 
Kj > z*z-29 (12.54 +0.05)% K3; 
Ky > w9n%r (19.52 +0.12)% K3; 
Ks zx^z- (69.20 £0.05)% Kg; 
959 (30.69 +0.05)% Ky2. 


Ks > 2 


The experimental problem is to detect the rare CP-violating decay modes 
Ki — z*z- and Kj > zx 9x, with branching ratios of order 10-3, in this 
overwhelming background of other decays, and to measure their decay rate, and, by 
interference, their phase relation to CP-conserving decay amplitudes. In addition, 
the CP impurity in the Kj, state can be obtained by measuring the charge asymmetry 
in the semileptonic decay modes. 


9.5.4.1 Charged Decay Modes 


The two charged decay products in z:*;z ^ and semileptonic decays are usually 
recorded in a magnetic spectrometer consisting of a wide-aperture magnet and at 
least three layers of position-measuring detectors. The vector momenta p; (i — 1,2) 
of the charged decay products are measured and the energies of the particles are 
obtained from the calculated vector momenta p;, assuming their rest mass to be 


My, as 
Ei =; P? +m}. (9.119) 


The invariant mass of the pair is 


Max = y (E1 + E2}? — (pi p? , (9.120) 


and the kaon momentum p, = p + po. The lifetime of the kaon from the target to 
the decay vertex (zv) in the kaon rest system is given by t = (zy — zr)m,/(cpz), 
where m, is the kaon mass, c the light velocity, and p; the component of p, along 
the beam line. 
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Two sets of information can be used to separate 27 and leptonic decays. First, the 
invariant mass Myy is required to be equal to m, within the experimental resolution. 
Second, all experimenters use lepton identification. 

The most frequently used methods for electron identification at intermediate 
energies, around 10 GeV, are Cerenkov counters, and identification through com- 
parison of the energy deposition in electromagnetic and hadronic showers. At high 
energies, i.e. for electrons with energies between 10 GeV and 100 GeV, electron 
identification in calorimetric detectors works on the principle that for a particle of 
momentum p, the energy E deposited in a calorimeter by an electron (or photon) is 
much higher than for a hadron of the same momentum. 

For the identification of muons one uses their penetration through several (8) 
interaction lengths of material in order to distinguish them from pions interacting in 
this absorber. 

Once the 27 decay mode has been identified, one has to know, in general, 
whether the Ks or Kr, from which the decay products originate has undergone 
scattering on its way from its production to the decay point. In the case of a short- 
lived beam produced by protons interacting in a target near to the detector, this is 
done by calculating the distance of the intercept of the reprojected kaon momentum 
py in the target plane from the target center, py. Unscattered events cluster around 
pi = 0. In the case of a long-lived beam, one uses the component of p, transverse to 
the beam, p:, or the angle 6 between the kaon direction p, and the beam direction 
in order to separate transmitted and coherently regenerated (0. = 0 = p) kaons 
from events due to kaons that have undergone scattering, or diffractive, or inelastic, 
regeneration. 


9.5.4.2 Neutral Decay Modes 

The detection of the neutral decay mode Kj, — z?z? — 4y is complicated by the 
presence of the decay Ki, — 32° — 6y with a 2196 branching ratio. This decay 
can simulate 4y events for kinematic reasons, e.g. if two y rays are missed by the 
detector. Very specific kinematic features of the 27° decay were therefore used in 
the early medium-energy experiments in order to obtain a clean Kr, — 27° signal 
[167—170, 172]. 

For kaon energies between 40 GeV and 200 GeV, totally absorbing electro- 
magnetic calorimeters are used. These calorimeters consist of scintillating crystals, 
Cerenkov lead glass counters, or liquid-noble-gas detectors with or without lead 
radiators. Their longitudinal thickness is around 25 radiation lengths, and their 
transverse segmentation corresponds to the transverse width of an electromagnetic 
shower, given by the Moliére radius Ry of the material. In this way, the energies E; 
and the transverse positions (x;, yj) of each of the four photon-induced showers are 
measured in the calorimeter. This is the only information available for reconstruct- 
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ing all variables describing the decay. In principle, the invariant mass of the four 
photons can be calculated using the relation 

2 


4 4 2 
M? =F -p = [x ni) = (x ai) (9.121) 
i=l i=l 


= ) XEiE; — E;E; cos6ij) (9.122) 
izj 
07 
= 2) BEI - (9.123) 
i<J] 


The opening angle 6;; between two photons can be obtained from the transverse 
distance r;; between the impact points in the calorimeter, 


rij 2 Oi = xj + Oi — yj», (9.124) 


and the distance z of the K meson decay point from the calorimeter. Using these 
variables, the invariant four-photon mass can be written as 


1 
M=- I$ EEs,. (9.125) 
i<j 


This relation can be used to calculate the distance of the decay point of the kaon 
from the calorimeter by using the kaon mass as a constraint: 


1 
z= M. > E;E jr}. (9.126) 


i<j 


With this knowledge about the decay point, the invariant mass of any pair (i, j) of 
photons can then be calculated: 


1 
Mij = zuv EE . (9.127) 


Of the three possible combinations, the one where both masses are closest to the 
z? mass is chosen. A scatter plot of m» versus m34 shows a signal at (mo, mo) 
if the four photons come from the decay Kj, — 2°7°, while for events from the 
decay Kj, — 37°, with four detected photons, the invariant masses are spread over 
a large region around this point (Fig. 9.20). It is possible to extract the amount of 
background in the signal region by extrapolating the observed level of background 


events into this signal region, with the help of Monte Carlo simulations of the Kj, — 
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37? background. In high-energy experiments, the background can then be reduced 
to a level below one percent. 


9.5.4.3 Detectors Measuring Charged and Neutral Decay Modes 
Simultaneously 


For the measurement of the parameter e' of direct CP violation, the ratio of the 
decay rates of Kr, into charged (zt ^ zt.) and neutral (959) two-pion states has to be 
measured with great precision. For this purpose, experimentalists reduce systematic 
normalization uncertainties by measuring charged and neutral decays (from Ky, and 
Ks mesons) simultaneously. 

Four such experiments have been constructed for high-energy K meson beams, 
two of them at CERN (NA31 and NA48) and two at Fermilab (E731 and kTeV). 

One experiment has been designed to detect K mesons of a few hundred MeV/c 
momentum arising from the annihilation of stopping antiprotons in a hydrogen 
target at the Low Energy Antiproton Ring (LEAR) at CERN (CPLEAR). 


9.5.4.4 NA31 


This detector (Fig. 9.7) [171], situated in a Kj, or Ks beam from the CERN SPS, was 
based on calorimetry and was designed for good stability and high efficiency. The 
Kı and the Ks beam were produced by a 450 GeV proton beam with a production 
angle of 3.6 mrad. 
A schematic illustration of the beam layout and the apparatus is shown in Fig. 9.7. 
The principal features can be summarized as follows: 


* to adjust the steeply falling vertex distribution of the Ks decays to the almost flat 
vertex distribution of the Ky, decays, the Ks target is located on a train that can 
be positioned at 41 stations in the decay volume; 

* ananticounter with a 7 mm lead converter in the Ks beam is used to veto decays 
in the collimator, defines the upstream edge of the decay region, and provides for 
the relative calibration of the 27° and z^ energy scales to a precision better 
than +1073; 

* two wire chambers spaced 25 m apart, with +0.5 mm resolution in each 
projection, track charged pions; 

* aliquid-argon/lead sandwich calorimeter with strip readout detects photons with 

+0.5 mm position resolution and an energy resolution of 
cg/E = 10%/E ®7.5%/VE © 0.6% (E in GeV); 

* an iron/scintillator sandwich calorimeter measures the energy of charged pions 
with +65%//E (E in GeV) energy resolution. 


The energies of the two pions and their opening angle are used to measure the 
invariant mass of the charged pair. 
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Fig. 9.7 Beam layout and detector in the NA31 experiment. The setup contains a movable target 
for the production of the Ks mesons, and a liquid-argon calorimeter; it does not contain a magnet 


The NA31 experiment collected Kj, and Ks decays in alternating time periods. 
In each case the charged and neutral decays were collected simultaneously. 


9.5.4.5 NA48 


This experiment (Fig. 9.8) was also built at CERN [201]. The detector was exposed 
to a simultaneous nearly collinear KL /Ks beam, derived from a 450 GeV/c proton 
beam from the SPS. 

The Ks beam was produced by using a fraction of the protons that did not interact 
with the Ky, target. 

Charged particles were measured by a magnetic spectrometer composed of four 
drift chambers with a dipole magnet between the second and the third chamber. 
The average efficiency per plane was 99.596, with a radial uniformity better than 
+0.2%. The space point resolution was 4:95 um. The momentum resolution was 
op/p = 0.48% €«p 0.009% x p, where the momentum p is in GeV/c. The mz 
invariant mass resolution is 2.5 MeV. 

A liquid-krypton (LKr) calorimeter was used to reconstruct K — 2zr° decays. 
Cu-Be-Co ribbon electrodes of size 40 um x 18 mm x 125 cm defined 13212 cells 
(each with a 2 cm x 2 cm cross section) in a structure of longitudinal projective 
towers pointing to the center of the decay region. The calorimeter was ~27 radiation 
lengths long and fully contained electromagnetic showers with energies up to 
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Fig. 9.8 Beam layout and detector in the NA48 experiment 


100 GeV. The energy resolution of the calorimeter was og /E = (3.23: 0.2)96/ E o 
(9 + 1)96/4/ E & (0.42 + 0.05)96 where E is in GeV. 


9.5.4.6 kTeV 


The kTeV experiment at the 800 GeV/c Tevatron [180] uses a regeneration tech- 
nique to produce the Ks beam (Fig. 9.9). The Kj, “double beam” entered from the 
left, one half continuing as Ky, the other half producing a Ks beam by regeneration. 
The regenerator in the KTeV experiment was made of blocks of plastic scintillator. 
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Fig. 9.9 kTeV beam and detector 


These blocks were viewed by photomultiplier tubes to reject inelastically or quasi- 
elastically scattered kaons. 

The evacuated decay volume extended over 40 m, or about 7 Ks mean lifetimes 
at 100 GeV/c, followed by the first drift chamber. The analysis magnet was located 
between the second and the third drift chamber. Each of the four drift chambers had 
two horizontal and two vertical planes of sense wires. The typical single-hit position 
resolution of the drift chambers was about 110 um in either direction, which lead 
to a mean mass resolution of 2.2 MeV for the reconstructed kaon mass obtained 
from the zx *z decay mode. The momentum resolution for a charged particle was 
op/p = 0.1796 & 0.007196 x p (in GeV/c). 

The four photons from the 27° were detected in an electromagnetic calorimeter 
made of pure cesium iodide. The calorimeter consisted of 3100 blocks arranged in 
a square array of 1.9 m side length. The blocks had two sizes: 2.5 x 2.5 cm? in 
the central region and 5 x 5 cm? in the outer region. All blocks are 50 cm, or ~27 
radiation lengths, long. Two 15 cm square holes allowed the passage of the Ky, and 
the Ks beam through the calorimeter. The calorimeter had an energy resolution of 
cog/ E — 296/4/ E 0.496 (E in GeV). The average position resolution for electrons 
was about 1.2 mm for clusters in the smaller crystals and 2.4 mm for the larger 
crystals. 


9.5.4.7 CPLEAR 


In contrast to the detectors described in the previous sections, the CPLEAR detector 
measured decays from kaons produced in pp annihilations at rest obtained from 
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the low energy p ring LEAR at CERN. The antiprotons were stopped in a 16 bar 
hydrogen gas target and formed a protonium before annihilation. In annihilation 
reactions of the type pp — K?K- z^ and pp — K°Kts~, the charged kaon was 
identified through the time of flight and track curvature in a solenoidal magnetic 
field (“tagging”). This tag was used as a trigger for detecting the decay products 
of the neutral K meson associated with the K* or K^. A unique property of this 
scheme is that the strangeness of the neutral K meson is known from the charge of 
the tagged KT or K7. 

As shown in Fig.9.10, the experiment had a cylindrical, onion-type, setup. Six 
cylindrical drift chambers, starting at a radius of 25.46 cm and going out to a radius 
of 50.95 cm, provided the main tracking information for charged particles. The 
offline track-finding efficiency was better than 99% and the wire positions were 
determined with an accuracy of about 10 um. The mean mass resolution achieved 
for the invariant kaon mass in the 7*z7 final state was 13.6 MeV/c’. By applying 
kinematically and/or geometrically constrained fits, the K? momentum resolution 
Op,/ Pt Was improved from 5.5% to 0.25%. 

The tracking detectors were followed by the particle identification detector 
(PID), used for charged-kaon identification and e/z separation. It was located at 
radii between 62.5 cm and 75.0 cm. and was composed of two layers of plastic 
scintillators with an 8 cm thick liquid threshold Cerenkov detector in between. 


The two charged tracks from the decays K? > ;r*z ^, z*e v, x * uv were 
reconstructed and the decay vertex was calculated. Using this vertex and the 
annihilation point in the hydrogen target, the proper time for decay of the kaon 
was obtained. 
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Fig. 9.10 CPLEAR detector 
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The electromagnetic calorimeter had the form of a barrel and was an assembly of 
18 lead plates alternating with sampling chambers. It was located at radii between 
75 cm and 100 cm. The calorimeter had a total thickness of ~6 radiation lengths, 
an energy resolution of og /E ~ 13%//E (E in GeV) and a position resolution 
of ~5 mm for the photon conversion points. The photon detection efficiency was 
(90 + 1)% for photon energies above 200 MeV. For photons with energies below 
200 MeV the efficiency dropped significantly and was only about 60% for 100 MeV 
photons. 

All subdetectors were embedded in a 3.6 m long, 2 m diameter solenoid magnet, 
which provided a 0.44 T uniform field. 


9.5.5 Elucidation of CP Violation in K? Decays (I): Search for 
Sm (e’ /e) 


9.5.5.1 The Significance of the Phase 6... 


The phase of n+_, +- = arg(n, .), was a possible clue that would help to 
disentangle the two components of CP violation, since 


ny =erte’. (9.128) 
If £’, the parameter of direct CP violation, was comparable in size to £, and if its 
phase was orthogonal to £, then the phase of 74— would deviate in a detectable way 


from the phase of ¢, which is mainly determined by the experimentally measurable 
values of Am and Is = 1/Ts (see Sect. 9.5.2): 


2^ 
arg(€é) = arctan (=) + p = w + Op (9.129) 
S 


With the present values of Am and Is (Sect. 9.5.5.7), 6,,, = (43.4 + 0.1)°. 

A significant deviation of the measured value of ;_ from ®,,, would be 
evidence for a nonvanishing component 3m/(e’/e) and against the superweak model 
of CP violation. 


9.5.5.2 Measurements of the Phase 9... in Interference Experiments 
Behind a Regenerator 


The relative phase between the two amplitudes of the decays Ki, — mtx7 and 
Ks — mtr" has been measured by two distinct methods. 

The first consists of measuring the interference of the KL —^ z ^x amplitude 
with the coherently regenerated Ks — z^; amplitude behind a slab of material 
(the regenerator). The experiments require (a) the measurement of the z'^z 
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Fig. 9.11 z decay distribution of K — r^ decays in the regenerator beam, for the restricted 
momentum range 40-50 GeV/c. The Monte Carlo prediction (dashed line) is without the 
interference term that is proportional to *2|o||n|" (KTeV experiment [180]) 


intensity as a function of the K? eigentime behind the regenerator, which is given in 
Sect. 9.5.3.1, and (b) an independent determination of the regeneration phase. 

The latest and most precise experiment of this type has been done by 
the kTeV collaboration [180] (see also [176]) in an experiment at Fermilab 
with an 800 GeV/c proton beam. The detector is described in Sect. 9.5.4.6; 
5 x 10? events were recorded in 1996-1997, among those were about 
9 x 10° K? — mtr” events. Their z decay distribution in the restricted 
kaon momentum interval from 40 to 50 GeV/c is shown in Fig.9.11. The 
corresponding decay time distribution was fitted with the formula (9.116). The 
nuclear regeneration amplitude F — i(f(0) — FO) /k was assumed to decrease 
with the kaon momentum p according to a power law F(p) = F(70 GeV/c)x 
(p/10 GeV/c)“. This was motivated by a Regge model where the difference between 
the K and K scattering amplitudes would be described by one single œ meson 
exchange trajectory. In this model, the phase of the regeneration amplitude is given 
by @- = —(r/2)(1 + a). In the fit, 6,_, Am, I's anda were free parameters, and 
Gy was assumed to be given by the Regge model. The results are 


P4 = 44.12? + 0.72? (stat) + 1.20? (syst) = 44.12? + 1.40? , (9.130) 
Am = (5288 + 23) x 10°s7! , (9.131) 
ts = (89.58 + 0.08 (stat)) x 107? s, (9.132) 
x? = 223.6 for 197 degrees of freedom . (9.133) 


The systematic error in 4... includes a 0.25? uncertainty from the fact that the 
relation between the regeneration phase and the momentum dependence of the 
regeneration amplitude through a dispersion relation integral is incomplete. It has 
been argued [178] that this uncertainty is larger, more than one degree, because of 
the limited momentum range in which the regeneration amplitude was measured. 
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9.5.5.3 Measurements of 9... in Vacuum Interference Experiments 


The other method for measuring 9... is the vacuum interference method mentioned 
above (Sect. 9.5.3.1), where one observes the K — mtx” distribution obtained 
from an initially pure strangeness state. The information on 9... is contained in the 
interference term proportional to cos(Am t — .,. ), and the time at which the two 
interfering amplitudes are equal is ^ 1271s, so that the correlation of ®4— with Am 
is rather strong. 

Three experiments of this type have been done in the intermediate-energy domain 
[166, 182, 183]. An analysis of the latest and most precise of those has been 
performed by the CERN-Heidelberg group [166]. The apparatus was situated in 
a 75 mrad short neutral beam derived from 24 GeV/c protons. The time distribution 
of 6 x 106 Ks, — zt *z- decays is shown in Fig. 9.12: (curve a), together with the 
fitted time distribution, as given in Sect. 9.5.3.1. The result of this fit is 


(Am — 0.540 x 1019 s7!) 


$,. = (49.4? + 1.0?) + 305? ; (9.134) 
Am 
Iņ+-| = (2.30 0.035) x 1073 , (9.135) 
Ts = (1.119 + 0.006) x 10719 s7! , (9.136) 
X^ =421 for 444 degrees of freedom . (9.137) 
(ipm $ 
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Fig. 9.12 Time distribution of K — ^; events from a coherent mixture of Kj, and Ks 
produced in pure strangeness states [166]. Curve a: events (histogram) and fitted distribution 
(dots). Curve b: events corrected for detection efficiency (histogram), and fitted distribution with 
interference term (dots) and without interference term (curve). Inset: interference term as extracted 
from data (dots) and fitted term (line). (CERN-Heidelberg experiment [166]) 
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In the high-energy domain, this method has also been used by the NA31 
collaboration [184]. In a 3.6 mrad neutral beam derived from 450 GeV protons, 
kaons of mean momentum around 100 GeV were allowed to decay. Two different 
target positions were chosen, at distances of 48 m and 33.6 m from the defining 
collimator of the neutral beam, which marked the upstream end of the decay 
volume. Kaons that decayed along 50 m in an evacuated tank were detected 
further downstream at about 120 m from the final collimator by the NA31 detector 
(Sect. 9.5.4.4, Fig. 9.7). The measured time distribution of st z^ decays is similar 
to the one in Fig. 9.12. There are 2.24 x 10° and 0.57 x 106 zr +z~ events in the data 
for the target in the near and far positions, respectively; the corresponding numbers 
of zt 9x? events are 1.81 x 10° and 0.31 x 10°. The phases were extracted from a 
fit to the time distribution of the ratio of the data in the near and far positions of the 
target. 

The results are 


(Am — 0.5351 x 1010 s7!) 
Am 
(rs — 0.8922 x 10-719 s) 


+ 270° ———————————— (9.138) 
TS 


4,- = (46.9? + 1.4?) + 310° 


and 


(Am — 0.5351 x 10!°s~!) 
Am 


— 0.8922 x 10-1? | 
pog 9 — UBSHAX TO 9 — audi 
US 


Poo = (47.1? + 2.1?) + 310? 


The difference between the two phases comes out to be 


Go — 0, = 0.2? +2.6° 12? . (9.140) 


9.5.5.4 Measurements of the Phase Difference oo — o, . 


For small |e’/é|, this phase difference is related to &'/& by the equation 
e 
oo — 4- = —3 3m (=) : (9.141) 
E 


In this way, the component of e&' orthogonal to the direction of £ can be measured. 
In the absence of CPT violation and for small |e’/e|, both of the phases 9o and 
@,_ are close to the superweak phase ®,,, (9.102). 

The measurement of this phase difference by the NA31 experiment (Sect. 9.5.5.3) 
was improved by the simultaneous measurement of the time distributions of 2 * zr - 
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and z 95? decays behind a regenerator in the KTeV experiment [180] (Sect. 9.5.5.2). 
Here, the uncertainty arising from the phase of coherent regeneration (which is the 
determining uncertainty in the ;_ measurement) cancels in the comparison of the 
two decay modes. The authors of [180] conclude that 


Poo — P4- = (0.39 £0.50)° . (9.142) 


Together with an earlier measurement by E731/E773 [179] and the NA31 measure- 
ment (9.140), this gives 


Poo — +- = (0.36  0.43)° . (9.143) 


9.5.5.5 Measurement of &,_ from a Tagged Pure Strangeness State 


The CPLEAR experiment (Sect. 9.5.4.7) offers the unique feature of tagging a 
pure neutral strangeness state K? or KÜ produced in a pp annihilation at rest 
by identifying a charged kaon produced in the same reaction. Compared with 
the vacuum interference experiments (Sect. 9.5.5.3), this offers the advantage of 
a full-size interference term, whereas in the vacuum interference experiments, an 
incoherent mixture of (predominantly) K? and K? forms the initial state, and the 
interference term is diluted. 

The interference term is visualized by measuring the decay-rate asymmetry for 


decays into zt * zt, 


Ay_(t) = AVS) (9.144) 
N(t) + N(x) 


28tee — 2|n..|]e 0/2 5* cos(Amt — @4_). — (9.145) 


In the corresponding experimental distribution, the background was subtracted 
and the events were appropriately weighted taking into account the tagging efficien- 
cies for K? and K®. The result is shown in Fig. 9.13 [181]. The result of a fit to these 
data gives values for 4... and |7n+~|, which are correlated with the values chosen 
for Am and rs, respectively. 

The correlation parameters are 


5D, = 0.30(Am — 0.5301 x 10/9 s7!) (9.146) 
and 


8|n,—.| = 0.09(ts — 0.8934 x 10719 s) . (9.147) 
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Fig. 9.13 (a) Measured decay-rate asymmetry, A, (1); the data points include residual back- 
ground. (b) Decay-rate asymmetry, A... (1); the data points are background subtracted. In both 
cases the continuous curve is the result of the fit (CPLEAR experiment, [185]) 


For the values of Am and ts chosen by the authors of [181], the results are 


P} = 43.19? + 0.53? (stat) + 0.28? (syst) + 0.42° (Am), (9.148) 
Iny—| = [2.264 + 0.023 (stat) + 0.026 (syst)] x 107° . (9.149) 


A similar asymmetry is obtained for decays to 29°, although with less statistical 
weight. 

The following values of the CP parameters in the neutral mode have been 
extracted: 


Poo = 42.0? + 5.6? (stat) + 1.9? (syst) , (9.150) 
Inool = [2.47 + 0.31 (stat) + 0.24 (syst)] x 107? . (9.151) 


9.5.5.6 Charge Asymmetry in Semileptonic Decays 


This asymmetry ôL is the third manifestation of CP violation (9.108). This 
asymmetry measures the CP impurity of the long-lived kaon state: ôL = 2Re e(1 — 
|x|7)/ ({1 — x5», where x is the AS — AQ violation parameter. Considerable 
precision was achieved in the first ten years of experimentation after 1964: for 
the Ke3 mode, two experiments, by the Princeton group [186] and by the CERN- 
Heidelberg group [187], have been reported; for the K,,3 mode, results have been 
obtained from Stanford [189] and one from the CERN-Heidelberg group [187], and 
a Brookhaven- Yale group [190] obtained a result for a mixture of both decay modes. 
Recently, two new results on this asymmetry have been reported by the KTeV and 
NAAS collaborations. The most significant features of these results are: 
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1. Event numbers of up to 298 million events in the Ke3 mode and 15 million events 
in the K,,3 mode. 

2. An apparatus design such that the decay products (zr and electron) traverse only 
minute amounts of matter (about 0.3-0.4 g cm), thus diminishing corrections 
due to secondary interactions of these particles. 

3. The precision of the K,,3 asymmetry measurements is a factor of 4 below the 
value for Ke3, making a comparison between the two possible. Table 9.3 gives 
the results; the average ôL = (3.316 + 0.053) x 1073. 


The charge asymmetries for Ke3 and K,,5 decays are equal to within 8%: ôf / ar = 
1.04 + 0.08. Assuming the validity of the AQ = AS rule, which is supported by 
the present experiments, we obtain Re e = (1.658 + 0.026) x 1073. If we use the 
most precise tests of the AQ — AS rule by the CPLEAR experiment [185], there is 
a small correction 


Ix 
— = 0.996 + 0.012. (9.152) 
|1 — x|? 

9.5.5.7 Parameters of CP Violation in the K? System: 3m/(e’ /e) 

Kg Lifetime We take the average of the older measurements combined with the two 
most recent measurements by the kTeV collaboration, (0.8965 + 0.0007) x 10-19 
s [180], and by the NA48 collaboration, (0.89598 + 0.0007) x 10-1? s [197]. Our 


grand average is 


ts = (0.8959 + 0.0004) x 10719 s. (9.153) 


Table 9.3 Charge asymmetry measurements in Kj3 decays 


Group and reference 8 [x10?] 
Columbia [164, 192] Kes 
Columbia-Harvard-CERN Ke3 average 
[193] 

San Diego-Berkeley [194] 3.6 E 1.8 3.322 + 0.055 
Princeton [186] Kes 
CERN-Heidelberg [187] Ka 

kTeV [188] Kes 3.322 + 0.058 + 0.047 

NAAS [157] Ke3 3.317 0.070 + 0.072 
Brookhaven-Yale [190] Ke3 + Ky3 
SLAC-Berkeley [165, 195] 

Berkeley [196] 60214 K,3 average 
Stanford [189] 2.78 + 0.51 3.19 + 0.24 
CERN-Heidelberg [187] Ky 


The K,,3 and Ke3 average value is db, = (3.316 + 0.053) x 10-3 
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We use this value in the following sections. 


Mass Difference Am Combining the values from the CERN-Heidelberg exper- 
iments [173-175] with those from Fermilab E731 and E773 [176, 179], from 
CPLEAR [181, 198, 199] and from the most precise single measurement by kTeV 
[180], we obtain 


Am = (0.5286 + 0.0011) x 10/9 s! . (9.154) 


Superweak Phase From the two parameters given above, we obtain the phase of ¢ 
in the superweak model, 


2Am 
@,,, = arctan n 


) = (43.4+0.1)°. (9.155) 


Moduli of the Amplitudes y, and noo New measurements of 3, from the 
ratio of mtx decays and semileptonic decays yield precise values of y, .. We 
combine these with previous results and obtain [160—162]. 


In4—| = (2.230 + 0.006) x 1073 . (9.156) 


Absolute measurements of the amplitude noo are much less precise. A recent average 
including the CPLEAR result is 


Ino] = (2.23 £0.11) x 1073. (9.157) 


Phase ®,_ In all measurements, this phase is extracted from an interference term 
with a beat frequency Am. Taking the results from NA31 [184], E731 [176], E773 
[179], and kTeV [180], together with the pre-1975 data and the result from CPLEAR 
[181], we obtain the world average 


$,. = (43.3 40.4)° (9.158) 


using the values for Am and ts above. This result is in excellent agreement with 
the value of Py = (43.4 + 0.1)°. Since the interference experiments were 
evaluated without assuming CPT invariance, this constitutes a stringent test of CPT 
invariance. The difference is 6, — d. = —(0.1 + 0.4)°. At the same time, this 
result can again be used to constrain the component of £’ orthogonal to e: 


/ 
Sm (=) = —(1.7 47.0) x 107° . (9.159) 
€ 


Therefore, at this level of 10~ relative to the amplitude for CP violation by mixing, 
£, there is no evidence for a direct CP violation amplitude &' orthogonal to €. Our 
interest now shifts to the component of &' parallel to e, i.e. 9te(e'/e). 
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9.5.6 Elucidation of CP Violation in K? Decays (II): 
Discovery of Direct CP Violation in 9&e(e' /e) 


9.5.6.1 Significance of the Double Ratio R 

The real part of &'/e is connected with the amplitude ratios noo and n+- of CP- 
violating Kr, decays to CP-conserving Ks decays. A measurement of decay rates 
with the required precision of 107? is only possible by measuring the ratio of rates 


in the same beam in the same time interval. From the Wu- Yang triangle relations 
(9.90) noo = £ — 2&' and n+- = € + z’, we obtain 
2 
à (9.160) 


No ol 
Ne (=) = ( = 
E 6 


A measurement of the double ratio 


700 


n+- 


Ino — TEL > 219/T(Ky > rtr) 


R= = 
m- TEs > 2n)/P (Ks > atx) 


(9.161) 


to a precision of about 0.3% is therefore required to distinguish between the two 
remaining models, the KM milliweak model and the superweak model. Since the 
KM model predicts values of Ste(e’/e) in the range between 0.2 x 1073 and 
3 x 10^, the precision required for detecting a signal of direct CP violation 
depends on the actual value. If the largest prediction is realized in nature, a precision 
of SR = 5 x 10^? would be sufficient for a three-standard-deviation observation. 
If, however, the lowest value is realized, a precision of óR = 0.4 x 107? would 
be needed, corresponding to samples of several million events for each of the four 
decay modes. 


9.5.6.2 The NA31 Experiment: First Evidence for Direct CP Violation 


The first observation of direct CP violation was made by a collaboration of 
physicists at CERN in 1988 [191]. The experiment, called “North Area No. 31”, or 
NA31, was based on the concurrent detection of 27° and x*+m~ decays. Collinear 
beams of Kj, and Ks were employed alternately. The beam layout and the apparatus 
were described in Sect. 9.5.4.4. The K? — 27° — 4y decays were reconstructed 
and separated from the background primarily due to Ki, — 32° — 6y decays 
as described in Sect. 9.5.4.2. This background is uniformly distributed in a two- 
dimensional scatter plot of photon-pair masses, while the 27r? signal peaks at a point 
S where both photon pairs have the 2° mass, with a 2 MeV resolution. Signal and 
background events were counted in equal-area x? contours around S. Figure 9.14 
shows the x? distribution of events in the Ks beam and in the Kj, beam. The signal 
region was taken as x? < 9. Background in the Kj, data was subtracted by linear 
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Fig. 9.14. Number of 0° 
accepted 4y events as a 
function of x? for the 

Ks — zt? (left) and 

KL > x 959 (right) data, and 
a Monte Carlo calculation of 
the background originating 
from Ky — 37° decays 
(dotted). The signal region 
was taken as x? < 9 (NA31 
experiment [191]) 
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extrapolation into the signal region, and amounts to about 4%, while it is negligible 
in the Ks data. 

The K? — ntr” decays were reconstructed from the four space points of the 
two pion tracks in two wire chambers. From these tracks, the position of the decay 
vertex along the beam was reconstructed with a precision of better than 1 m. The 
energies E, and E» of the two pions were obtained from the energies deposited 
in the liquid-argon electromagnetic calorimeter and the iron-scintillator hadronic 
calorimeter. The K? energy was then calculated using the kaon mass and the opening 
angle 0 of the two tracks as constraints from the ratio E/E: 


A 
E, = a (m2 — Am), (9.162) 
where 
Aa eg (9.163) 
B E> Ei ` i 


Background from K? — sev (Ke3) decay was reduced by comparing, for each 
track, the energy deposited in the front half of the electromagnetic calorimeter with 
the energy deposited in the hadron calorimeter. 

After cuts on the invariant zt *;zt- mass and on the transverse location of the 
center of energy relative to the center of the neutral beam, a residual background of 
three-body decays was subtracted. 

Figure 9.15 shows the transverse distance dr between the decay plane, as 
reconstructed from the two tracks, and the K? production target, at the longitudinal 
position of the target. For Ks decays, this distributions peaks at dr = 0 with a 
resolution given by the measurement error and multiple scattering. For Ky, decays, 
in addition to this component of two-body decays from the target, there is a broader 
distribution mixed in due to three-body decays. The signal region was taken to be 
dy « 5 cm, and the three-body background was extrapolated from a control region 
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Fig. 9.15 Event distribution for charged decays as a function of the distance dT (in cm) between 
the decay plane and the production target for Ks (left) and Ky, (right) decays, and for various 
background components (right) (NA31 experiment [191 ]) 


7 « dy « 12 cm. This background amounts to (6.5 + 2.0) x 107? of the signal, 
including systematic uncertainties. 

The remaining event sample contained 109 x 10? of K > z 959, 295 x 10? 
of KL > xt”, 932 x 10? of Ks — 297°, and 2300 x 10? of Ks > ata. 
In order to equalize the acceptance for Ks decays (with an average decay length 
of 6 m) to that for the uniformly distributed Kj, decays, the Ks data were taken 
with the Ks target displaced in 1.2 m steps over 48 m such that the distribution 
of Ks decays became effectively uniform in the fiducial region. This makes the 
double ratio essentially insensitive to acceptance corrections. The double ratio was 
evaluated in 10 x 32 bins in energy and vertex position. The weighted average, after 
all corrections, is 


R = 0.980 + 0.004 (stat) + 0.005 (syst) . (9.164) 


The corresponding result for the direct-CP-violation parameter is [191] 


/ 
Ne (=) = (33.0 + 11.0) x 1074. (9.165) 
€ 


This, with three-standard-deviation significance, shows that the CP-odd K» decays 
to two pions, and was the first evidence of direct CP violation. 

In further measurements with the NA31 detector, the event numbers recorded 
were considerably increased, by a factor of four, thus decreasing the statistical error. 
In addition, the background from Ke3 decays was reduced by introducing a two- 
stage transition radiation detector as an additional identifier for electrons. With these 
improved data, the double ratio was measured with reduced uncertainty. Including 
the former result, the double ratio obtained is 


R = 0.982 + 0.0039 , (9.166) 
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leading to a value 
/ 
Ne (=) = (23.0+6.5) x 107^, (9.167) 


or 3.5 standard deviations from zero. 


9.5.6.3 The Experiment E731 at Fermilab 


In this experiment [177] charged and neutral decays were registered in separate runs 
with a slightly different detector. On the other hand, in E731 Ky, and Ks decays were 
collected simultaneously using a split beam: in one half of the beam cross section, 
Ky mesons from the target were allowed to decay over a long decay region of 27 m 
(charged decays) or 42 m (neutral decays), while in the other half, the Kj, beam hit 
a block of B4C, whereby a beam of Ks mesons was regenerated. 

The vertex distribution of the events from regenerated Ks was concentrated in 
a small region behind the regenerator, positioned at 123 m from the target, owing 
to the typical Ks decay length of 5 m. On the other hand, the vertex distribution of 
KL — zz? decays extended from 110 to 152 m from the target. The detector 
acceptances for Kj, and Ks decays therefore were very different, and since the 
decay volumes for Kj, > ata and Ky, > z?z were different, this acceptance 
correction does not cancel in the double ratio. 

The z * x decays were selected by requiring the invariant 27 mass to be near 
the kaon mass and the transverse kaon momentum to satisfy p? < 250 MeV?/c?. 
The background from incoherent kaon regeneration amounted to (0.155 +0.014)%. 
The extrapolated 32° background under the peak is 1.78% and 0.049% in the 
vacuum and regenerator beams, respectively. Neutral background from incoherent 
scattering in the regenerator was subtracted by evaluating the distribution of the 
transverse center of energy of each event around the center of each beam. After this 
background subtraction, the event numbers in the vacuum beams were 327 x 10° 
(x+m~) and 410 x 10? (x?z9). In the regenerator beams, there were 1.06 x 106 
7*7 events and 0.800 x 106 27? events. The regeneration amplitude was assumed 
to fall in accordance with a power of the kaon momentum, p^. In the fit, the 
parameter o, the regeneration amplitude at 70 GeV/c momentum, and 3te(e'/&) 
were varied. The results were 


a = —0.6025 + 0.0065 (9.168) 


and 


f 
Re (=) = (7.4 5.2 (stat) + 2.9 (syst)) x 1074 , (9.169) 
E 
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where the systematic uncertainty includes a part from acceptance calculations 
(1.19 x 1074) and from the energy calibration (1.6 x 107^). The authors of [177] 
deduce an upper limit Nte(e’/e) < 17 x 107^ (95% C.L.), which is at variance with 
the observation of the NA31 experiment. 


9.5.6.4 The kTeV Experiment at Fermilab 


The disagreement between the positive result of NA31 and the null result of E731 
left an unsatisfactory state of affairs. For this reason, new experiments with a tenfold 
increase in data-taking capacity and reduced systematic uncertainty were designed, 
at both Fermilab and CERN. 

The Fermilab experiment at the 800 GeV Tevatron, called kTeV, was described 
in Sect. 9.5.4.6 [180]. The main improvements compared with E731 were: 


— all four decay modes were measured concurrently; 

— the electromagnetic calorimeter was made of CsI, with much improved energy 
resolution; 

— the regenerator at 123 m from the target was made of plastic scintillator, viewed 
by photomultipliers such that inelastic regeneration could be detected by the 
recoiling nucleus; 

— the kaon momentum range from 40 to 160 GeV/c and the decay vertex region 
from 110 m to 158 m from the target were the same for all decay modes. 


As in the E731 experiment, a double beam of Kj, and regenerated Ks entered 
the decay volume. K? — ;r^z- decays were identified by their invariant mass. 
Semileptonic Ke3 events were reduced by a factor of 1000 by requiring the ratio 
of the calorimetric energy E of a track to its momentum p to be less than 0.85. 
K,3 events were rejected by registering the muon penetrating the 4 m iron wall. 
The invariant-z zt -mass shows a rms mass resolution of 1.6 MeV, and events in the 
range 488—508 MeV were selected. Background from Ks produced in incoherent 
regeneration was suppressed mainly by vetoing events with a signal generated in the 
active regenerator indicating the recoil of a nucleus in the scattering process. Further 
reduction of this background was achieved by extrapolating the kaon direction back 
to the regenerator exit face and calculating the transverse momentum of the kaon 
relative to the line connecting this intercept with the target position. After a cut 
against the backgrounds from semileptonic decays and from collimator scattering, 
11.1 million and 19.29 million mtm events remain in the vacuum beam and 
regenerator beam samples, respectively. 

The selection of 27° events follows the lines described in Sect. 9.5.4.2. Events 
with an invariant zz? mass between 490 and 505 MeV were selected. Events in 
which a kaon scatters in the collimator or the regenerator were reduced by a cut 
in the ring number (RING), defined by the maximum deviation Axoe or Aycoe (in 
cm) of the center of energy of all showers from the center of the corresponding beam 
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spot at the CsI position, to which the event was assigned by use of the x-position of 
the center of energy: 


RING = 4 x Max(Ax2,,, Ay2,,) . (9.170) 


The signal was selected by the cut RING < 110 cm’. The largest background 
comes from regenerator scattering in the regenerator beam, 1.13%, adding up to 
a total of 1.235% in that beam. Also, in the vacuum beam, the events scattered 
in the nearby regenerator make the largest contribution to the background, 0.25%, 
which is 0.48% in total. After all cuts and background subtraction, the remaining 
signal consists of 3.3 million and 5.55 million events in the vacuum and regenerator 
beams, respectively. The 3.3 million Ki, — zx? events are the limiting factor in 
the statistical uncertainty in the double ratio. 

Since the vertex distributions of Kr, decays (flat) and Ks decays (concentrated 
behind the regenerator) are very different, the raw double ratio has to be corrected 
by the double ratio of acceptances (Fig. 9.16). The quality of the Monte Carlo simu- 
lations for the acceptances was checked by reproducing the z vertex distributions 
of the vacuum beam data for different decay modes (Fig.9.17). In general the 
agreement is good, except that the mtm data show a slope of (—0.70 + 0.30) x 
107^ /m. 

The result for Re(e’/e) is [180] 


1 
Re (=) = (20.71 + 1.48 (stat) + 2.39 (syst) x 1074 = (20.7 + 2.8) x 107*. 
€ 
(9.171) 


The systematic uncertainty for the neutral decays is mainly due to background, CsI 
energy calibration and acceptance corrections; for the charged decays, it is mainly 
due to uncertainties in the acceptance and trigger efficiency. 

The acceptance correction that has to be applied is about 5 x 107? for R, or 
~ 80 x 107^ for 3te(c'/&), four times larger than the signal. 


9.5.6.5 The NA48 Experiment 


When the NA31 observation of a nonvanishing Ste(e’/e) was not confirmed by the 
result from the E731 experiment, the CERN-based collaboration set out to construct 
a new, improved detector with the goal of achieving a precision measurement of 
3te(e'/&) with a total uncertainty of 0.2 x 1074. 

The new experiment was designed 


— to measure all four decay modes concurrently by using two incident proton 
beams; 

— to improve on neutral-background rejection by developing a liquid-krypton 
electromagnetic calorimeter with substantially better energy resolution; 
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Fig. 9.16 (a) z vertex distribution for reconstructed K — 2+z~ decays for the vacuum beam 
(thick) and regenerator beam (thin histogram). (b) z vertex distribution for reconstructed K — 
nn decays. (c) Kaon momentum distribution for reconstructed K — +7 decays. (d) Kaon 
momentum distribution for reconstructed K — zx? decays. All K — mz analysis cuts have 
been applied and background has been subtracted (see [180]) 


— to improve on charged-background rejection by using a magnetic spectrometer. 


The resulting beam and detector have been described in Sect.9.5.4.5. Data 
were taken in 1997, 1998 and 1999 with 450 GeV protons. In the design of the 
NAAS detector, the cancellation of systematic uncertainties in the double ratio was 
exploited as much as possible [201]. Important properties of the experiment are 


— two almost collinear beams, which lead to almost identical illumination of the 
detector, and 
— the Ks lifetime weighting of the events defined as Ky, events. 


The Kj, target was located 126 m upstream of the beginning of the decay region, 
and the Ks target 6 m upstream of the decay region. The beginning of the Ks decay 
region was defined by an anti-counter used to veto kaon decays occurring upstream 
of the counter and defining the global kaon energy scale. 

The identification of Ks decays was done by a detector (tagger) consisting of an 
array of scintillators situated in the proton beam directed on to the Ks target. To 
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Fig. 9.17 (a) Comparison of the vacuum beam z distributions for data (dots) and MC calculations 
(histograms). The data-to-MC ratios (b) have been fitted to a line, and the z slopes are shown. The 
neutral distributions are for the combined 1996 + 1997 samples; the charged distributions are for 
1997 only (KTeV experiment [180]) 


identify events coming from the Ks target a coincidence window of +2 ns between 
the proton signal in the tagger and the event time was chosen (see Fig. 9.18). Owing 
to inefficiencies in the tagger and in the proton reconstruction, a fraction o, of true 
Ks events are misidentified as Kr, events. On the other hand, there is a constant 
background of protons in the tagger which have not led to a good Ks event. If those 
protons accidentally coincide with a true Ky, event, this event is misidentified as a 
Ks decay. This fraction œ, depends only on the proton rate in the tagger and the 
width of the coincidence window. 
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Fig. 9.18 (a), (c) Minimal difference between tagger time and event time (Afmin). (b) Atmin for 
charged K; and Kg events. (d) Comparison between charged and neutral event times. For this 
measurement, decays with tracks selected by the neutral trigger were used (y conversion and Dalitz 
decays Ks — none — yyye'* e ) (NA48 experiment [201 ]) 


Both effects, at and Cages can be measured (see Fig.9.18b) in the charged 
mode, as Ks and Kj, events can be distinguished by the vertical position of the decay 
vertex. The results are at = (1.63 0.03) x 1074 for the data from 1998/1999 


and (1.12 + 0.03) x 1074 for the data from 2001. For the accidental-tagging rate, 
the value measured was WT = (10.649 + 0.008)% for the 1998/1999 data sample 
and (8.115 + 0.010)% for the 2001 sample, owing to the lower instantaneous beam 
intensity. This means that about 11% or 8% of true Kr, events are misidentified as Ks 
events; however, this quantity is precisely measured to the 1074 level here. For the 
measurement of R the difference between the charged and the neutral decay modes, 


Aa, = a? — awt-, is important. Proton rates in the sidebands of the tagging 


window Were measured in both modes to determine Ag, s. The result is Agis = 
(4.3 + 1.8) x 1074 for the 1998/1999 event sample and (3.4 + 1.4) x 1074 for 
the 2001 event sample. Several methods have been used to measure Ao, , leading 
to the conclusion that there is no measurable difference between the mistaggings 


measured by different methods within an uncertainty of +0.5 x 1074. 
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Another important correction is the background subtraction. Decays of the types 
K — sev and Ky — zr uv can be misidentified as K > at27 decays, as the v 
is undetectable. These events were identified by their high transverse momentum pi 
and their reconstructed invariant mass. The remaining background can be measured 
by extrapolating the shape of the background in the pe distribution into the signal 
region (Fig. 9.19). In this way, the charged background fraction leads to an overall 
correction to R of (16.9+3.0) x 1074 for the 1998/1999 data and (14.2+3.0) x 1074 
for the 2001 sample. 

The reconstruction of zz? decays followed the principles described in 
Sect. 9.5.4.2. The two yy masses for the best pairing are anticorrelated because 
of the constraint of the kaon mass (Fig. 9.20). The ellipses in Fig. 9.20 designate 
contours with increments of one standard deviation. The background from 37° 


0 
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Table 9.4 Event numbers of the NA48 experiment after tagging correction and background 
subtraction [201, 202] 


Event statistics [x 109] 


1998/1999 2001 [1998/1999  |2001 
Ks > ata 22.221 9.605 K > atx 14.453 7.136 
Ks > z 92? 5.209 2.159 Kp > z 5? 3.290 1.546 


decays with two undetected photons is distributed with constant probability over 
each ellipse, as shown by Monte Carlo calculations. This leads to a correction to R 
of (—5.9 + 2.0) x 1074 for the 1998/1999 sample and (—5.6 + 2.0) x 1074 for the 
2001 sample. 

The numbers of signal events after these corrections are summarized in Table 9.4. 

The efficiency of the triggers used to record neutral and charged events has been 
determined. In the neutral decay mode the efficiency was measured to be 0.99920 + 
0.00009, without any measurable difference between Ks and Kj, decays. The x ^ ^ 
trigger efficiency was measured to be (98.319 + 0.038)% for Ky, and (98.353 + 
0.022)% for Ks decays. Here, a small difference between the trigger efficiencies 
for Ks and Kj, decays was found. This leads to a correction to the double ratio of 
(—4.5 + 4.7) x 1074 for the 1998/1999 sample and (5.2 + 3.6) x 1074 for the 2001 
sample. 

Other systematic uncertainties include the limited knowledge of the energy scale, 
nonlinearities in the calorimeter, and small acceptance corrections. 

Summing all corrections to and systematic uncertainties in R, the authors find the 
amount to (35.9 + 12.6) x 1074 for the 1998/1999 data and (35.0 + 11.0) x 107^ 
for the 2001 data. 

The corresponding result for the direct-CP-violation parameter is 


£ 
Re (<) = (15.3 + 2.6) x 10-4 (9.172) 
€ 


for the data from 1997 [203] and 1998/1999 [201], and 


rd 
Re (=) =(13723.) x 1074 (9.173) 
€ 


for the data from 2001 [202]. 

A comparison of these two values is significant because they were obtained 
at different average beam intensities. The combined final result from the NA48 
experiment is 


Re (>) = (14.741.4 (stat)+0.9 (syst)£1.5 (MC)) x 107^ = (14.74+2.2)x 1074 . 
E 
(9.174) 
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9.5.6.6 Conclusions About Direct CP Violation, Ste(e’ /e) and the 
Wu-Yang Triangle 


The two experiments kTeV and NA48 have definitively confirmed the original 
observation of the NA31 team that direct CP violation exists. The results of all 
published experiments on €'/e are shown in Fig.9.21. Therefore, CP violation as 
Observed in the K meson system is a part of the weak interaction due to weak quark 
mixing. Exotic, new interactions such as the superweak interaction are not needed. 
We therefore have a very precise experimental result for &'/e. The theoretical 
calculations of &'/& within the Standard Model, however, are still not very precise. 
This does not change the main conclusion of the experiments that &' is different 
from zero and positive, i.e. direct CP violation exists. 

If we take into account the four relevant experiments NA31, E731, NA48, and 
kTeV, the weighted average comes out to be 


/ 
Re (=) = (16.7 + 1.6) x 107^. (9.175) 
€ 


The consistency of the result is not completely satisfactory, since x?/ndf = 6.3/3. 
If the phase of &' as defined in (9.92), arg e’ = (42.3 + 1.5)? [149], is inferred, then 
a more precise value for the component of &' transverse to £ can be derived. 

We have done a complete fit to the Wu—Yang triangle (9.90), using as input 


$ 
Re (2) = (16.7 + 1.6) x 107^, Nee = (1.658 + 0.0265) x 1073 , 
€ 


$, = 43.3°+0.4°, Dog — +- = (0.36 + 0.43)° , 


[na—| = (2.230 + 0.006) x 1073, noo] = (2.225 + 0.007) x 1073 , 


arg £ = 43.4 + 0.1°, arge' = 42.3° + 1.5°. 
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Fig. 9.22 Result of the fit to the Wu-Yang triangle relations between 7+—, noo, € and z’. (a) 
Correlation of Stee and Sm e ; (b) correlation of Ne(e’ /e) and Sm(e'/&). The boundaries of the 
black areas correspond to one-standard-deviation uncertainties 


The result of this fit is, with x?/ndf — 3/4 


/ 
em (=) = (-3.2+4.4) x 1075 (9.176) 
€ 


and 


Sm e = (1.530 + 0.005) x 10? . (9.177) 


Also, the real part of £ is constrained by the fit: 


Ree = (1.619 + 0.005) x 10? 


The result of this fit is expressed in correlation plots for Ree and 3m £, and for 
9te(e'/&) and Sm(e'/&), in Fig. 9.22. 

The result for direct CP violation (9.175) can also be quoted as a decay 
asymmetry between the K^ and K? decay rates to a m"x" final state. If the 
amplitudes are called a = amp(K® — ;^z-) anda = amp(K? > ntr”), 
then this asymmetry is 


0 DR? > ata) - PK > rtr)  |al-[ap 


A = — = a Lp- 
rR? > wta-) + rR? > rtr) la|? +a? 


(9.178) 
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Since a/a = 1 — 2e’, we obtain 


—-2 

-| 21-—43Sez' (9.179) 

a 

and 
& & 
A — 28tee' 22 (ve: Re (=) — 3m e $m (=)) = (5.5 + 0.6) x 1076. 

E E 

(9.180) 


This very small decay rate asymmetry can be compared with the large values of 
some similar observables in the B system. 


9.6 The Neutral B Meson System 


The KM model today often also referred as CKM! mechanism of the Standard 
Model predicts direct CP violation for the neutral K meson system and it was 
thus a strong support for the theory when a finite value of Nte(e’/e) was observed. 
The ultimate test of the quark-mixing paradigm however was the precise study of 
CP violation in the neutral B meson system for which the theory predicted large 
CP violation. The discovery of B? — B? mixing in 1987 [58] made the B? system 
the prime candidate for observing CP violation in a physical system different from 
the neutral kaon. The observation of large time-dependent C? asymmetries in B? 
decays in 2001 by the B-factories [59, 60] allowed the determination of the complex 
phase à of the CKM matrix and as a consequence provided a first sensitive test of the 
unitarity of the quark mixing matrix. With the observation of the B? meson mixing in 
2006 [61] a second B meson system became available for precise mixing studies and 
studies of CP violating effects. With the start of the LHCb experiment [62] the focus 
of the B meson studies have changed. Precision measurements no longer aim to 
prove the CKM paradigm but look for small deviations from the theory predictions 
as possible smoking-guns for physics beyond the description of the Standard Model. 


IN, Cabibbo introduced the concept of quark mixing for two quark generations [56]. It was 
extended to more generations by M. Kobayashi und T. Maskawa who also explained CP violation 
in case of three and more generations [57]. The corresponding 3 x 3 quark mixing matrix is today 
referred as KM or CKM matrix. In the remaining part of this chapter the term CKM matrix is used. 
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9.6.1 Phenomenology of Mixing in the Neutral B Meson 
System 


The parameters of mixing in the two neutral B mesons—the B? with quark content 
(bd) and the B? with quark content (bs), in the following they are called both 
generically B°—are very different from those observed in the neutral K meson 
system. In Sect. 9.3 the complex decay parameters yp and y; are defined for the 
heavy (h) and light (7) meson states (see Eq. (9.41)): 


. IN 
Ya = my + y, 

' Ti 
= i 


With these parameters the time evolution of the neutral B states can be written as 
(see Eqs. (9.34) and (9.40)) 


IB) = (p |B?) — a [Be . 


BO) = (p |p?) + a [Bo pen (9.181) 


The two initial states at t = 0, with a definite quantum number B , are 


Y (0) = B^ and y- (0) = B9 . (9.182) 


Their decay law is not any longer exponential and results to a finite probability P 
for a flavor change given by Eqs. (9.44) and (9.45). In the Standard Model, the CP 
violation in mixing of neutral B mesons is expected to be very small, such that 
Ig/p| = 1 within O(107^) and ©(1075) for the B? and the B? system respectively 


[63]. In this approximation the mixing probabilities for the two B^ and B are equal, 
ie. 


P(B? > B?) = P(BO > B9), (9.183) 
and Eq. (9.44) can be written as 


0 . ZG l r l nm ATt 
P(B” > B®) = p? (cosh yT — cosxT) — 3" cosh c E cos(Amt)] . 
(9.184) 
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Fig. 9.23 Feynman diagrams of the B? mixing 


Correspondingly, the probability for remaining in the original beauty state is, 
0 0 TUBE TT 1 nm ATt 
P(B’ => B) = P(B} => BY) = 5° ru +cos(Amt)). (9.185) 


One thus obtains for the flavor asymmetry at time ¢ of an initially pure flavor state 
(see Eq. (9.50)): 


P(B? —> B?) — P(B? > B®)  cos(AmT) 


A(t) = — s 
P(B? — BO) + PB? > B9) cosh 4+ 


(9.186) 


The mixing parameter x is determined through short-range interactions given by 
box diagrams of the type shown in Fig. 9.23. For the B mixing the diagrams with a 
virtual ¢ quark dominate and one obtains [63, 65] for the mixing parameter x, 


Am G; 2 x 2:442. m; 
X= TF = gaz Om Fog! Bg Tg, | Vi Via] MwF M, Noco (9.187) 


with index q = d(s) for the p? (B9) respectively. Here, ny and Ty, are the mass 
and the lifetime of the neutral B meson, Bg, is the bag factor parametrizing the 
probability that the d (s) and the b quarks will form a p? (B9 hadron, SB, is the 
B meson decay constant, F is the calculated loop function, increasing with the top 
quark mass squared m?, and noc, © 0.8 is a QCD correction. The parameters Gp 
and Mw are the Fermi coupling constant and the mass of the W boson and V;; are 
the CKM matrix elements (see Sect. 9.7). 

For the B? meson the lifetimes of the heavy and light state are approximately 
equal, In = I; = P , and y = AT/2r ~ 0. In this case the flavor asymmetry 
simplifies to A(t) = cos (Amt). For the B? meson a significant lifetime difference 
between the heavy and light states arises due to different decay channels as result of 
the different CP eigenvalues of the two mass states. The lifetime or width difference 
can thus be measured using final states with defined CP values.? From recent theory 
calculations one expects for the B? meson AT /T ~ 13% [64, 66]. It should further 


?This is strictly true only if one ignores the very small CP violation in mixing. 
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be noted that applying the definition of Sect. 9.3 for AT leads to a negative width 
differences for the B? mesons. It is therefore common practice [67] to redefine AT 
to be positive, 


AT =T;—-Ty,. (9.188) 


To test the approximation |g/p| 7 1, ie. the assumption of vanishing CP 
violation in the mixing, 


P(B? > B®) — P(B? > BO) 1—|q/pl* - 
PB? > B?) + P(B? > BO)  1-|g/pl" 


amix = 


one can measure the semi-leptonic CP asymmetry for the two B? species (see 
Eq. (9.56)), 


T(B9(r) > £^vX) — T(BU(tr) > £7vX) 
m dii Anda. NN (9.189) 
T(B9(r) > £^vX) + T(GB9(t) > £-vX) 


where the detection of a B? decaying into a wrong-sign muon B? — B} — £^vX 
indicates mixing. 

The measurement of the time-dependent CP asymmetry between the decays of a 
B? and a BÓ to a common final state f probes the CP violation in the interference 
between the decay with and without mixing. Following Eq. (9.51) one introduces 
the decay amplitudes 


Aş = (FITIB) ,— Ag =(fITIB), 


Aj = (FITIB) , Az — (f|T|BO) . (9.190) 


The theoretical description simplifies if decays to CP eigenstates B > fc p are 
used. These final states fulfill CP | fc p) = | fc») = ncp |fcp) with ncp = +1 and 


Ay = ncpAj. 
Using Eq. (9.42), one obtains the decay rates of initially pure B? and B? states, 


2 
WO Henriette 
p 
dN( 1 = 
“i = " (e! pe )A,— P (emt — e "A; (9.191) 
fU 
The time dependent CP asymmetry is defined as 
dN — dN 
acp (t) = (9.192) 


dN +daN ` 
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Neglecting the very small deviation of |q / p| from unity (i.e. CP violation in mixing) 
and using the ratio defined in Eq. (9.53), 


Af 
po. (9.193) 
pAry 
one obtains for ac p (t) [68], 
ast AG’, cos(Amt) + AP? sin(Amt) (9.194) 
cosh(4-r) + Aar sinh( Ar) 
with 
2 
, 1-] " 28A; 290. 
Adi, = spun. A , Amis Sf . Aapm-———-. (9.195) 
1+ p^. 1+ [^s] 1+ A] 


The first term accounts for possible direct CP violation while the other two terms 
encode properties of the mixing. Experimentally, by measuring the time dependent 
CP violation ac p(t) one can determine the coefficients of cos(Amt) and sin(Amt) 
and thus determine lA; | and SA y. In general, non-perturbative QCD effects prevent 
to relate these quantities to CP phases originating from the quark mixing. However, 
in case of so called golden modes which are dominated by a single decay amplitude 
and thus a single combination Vcxm of CKM elements such an association is 
possible. Since the strong interaction respects the CP symmetry, golden modes 
fulfill: 


Af Aj Vorn 
Af Af Verm 
From this equation one sees that [Az] = lA y| and that assuming |g/p| = 1, one 


obtains ls] — ]. Thus golden modes satisfy 
AT, L0, APILIC-SA,. (9.197) 


For golden decays of B? mesons for which the decay width difference AT is 
negligible, expression (9.194) further simplifies to 


ac p(t) = —SAy sin(Amt) . (9.198) 


Thus, if 4 carries a non-trivial weak phase $ (SA ¢ # 0) the time dependent CP 
asymmetry will show a sinusoidal time behaviour with an amplitude given by sin $. 
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9.6.2 Production and Detection of B-Mesons 


The precise measurement of oscillation and C7 violation in the neutral B meson 
system requires a high number of produced B mesons. A good reconstruction of 
the B meson decay vertex, necessary for an excellent decay time resolution, is 
needed to resolve the time-dependent effects. To perform the CP measurements 
the determination of the production flavor of the B meson, often referred as flavor- 
tagging, is necessary. 


9.6.2.1 e*e- B-Factories 


The first experiments which have systematically addressed these requirements are 
the BABAR experiment [69] at the et e^ collider PEP-II at eae and the Belle 
experiment [70] at the Japanese ete~ collider KEKB. Both ete~ machines were 
operating at a centre-of-mass energy of ./s = 10.58 GeV, corresponding to the 
mass of the Y (4S), an excited bb resonance which decays to BÜpo (~ 50%) and 
B*B- (~50%). The cross section for the Y (4S) production is 1.1 nb which is about 
a quarter of the total hadronic cross section. 

BABAR was operated from 1999 to 2008 and collected a data set corresponding 
to an integrated luminosity of 550 fb! . The Belle experiment has also been started 
in 1999 and was taking data until the end of 2009, collecting about 1 ab^! of data. 
Due to the large number of B mesons the two experiments have recorded, they are 
often referred as B-factories.? To continue this successful path, the KEKB collider 
as well as the Belle experiment have both undergone an upgrade. The new Belle-II 
experiment has started data-taking in 2018 and will be operated at a 40 times larger 
instantaneous luminosity [72]. 

The mass of the Y (4S) lies only 11 MeV/c? above the sum of the two produced 
B mesons which would therefore be produced essentially at rest and would not fly. 
The vanishing decay length would prevent a measurement of the decay time. To 
overcome this problem both machines have been operated with slightly asymmetric 
beam energies resulting in a small boost of the ' (4S) and thus of the produced B 
mesons. The boost factor By was 0.56 (0.425) for the BABAR (Belle) experiment, 
resulting into typical decay lengths of about 250 um for the two B mesons with 
lifetimes of 1.5 ps. 

If the Y (4S) with J P= {7 decays into a pair of neutral B mesons, the two 
mesons are produced in a coherent BÓB? state with negative parity, 


B® (6))B°(62) — B°(@2)B9(01) , (9.199) 


3A comprehensive description of the two e+ e~ machines, the two detectors, the operation and the 
physics results can be found in [71]. 
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where 64.5 are the B meson production angles relative to the e* direction. The 
flavors of the two mesons are thus fully correlated until the first meson decays at 
time f. If at that time this meson is a B? the flavor of the second meson is fixed to 
be a B? and vice versa. The time evolution of the second B meson with decay time 
t2 is then given by 


W(t) — B9 (e Yn(2—1) | e-Yi (to 8 _ ^p" (e yh(t2=t1) _ g-vi(ta 0) , 
(9.200) 


i.e. the time evolution of the flavor of the second B meson is defined by the time 
difference At = t? — tı between the two decays. Depending on which of the two B 
mesons is studied, Aż can also be negative. 

The decay topology of a B°B® pair produced at t = 0 from the decay of the 
'T (4S) is illustrated in Fig. 9.24, where one of the neutral B mesons (signal B) is 
decaying at t into the golden mode J/ y Ks used to measure the time-dependent CP 
violation, and the other B meson (tagging B) decays at t, into a flavor-specific final 
state. The charge of the electron indicates the flavor of the B meson at the time of 
the decay (B). At this time the signal B was thus a B®. 

Due to the boost of the Y (4S) the different decay times result into different z- 
positions of the two decay vertices along the beam direction. The difference Az is 
about 250 um and is related to the decay time difference At: 


Az-—z52—z|— yc (h — t) = BycAt. (9.201) 


Both B-factories are very similar. Therefore only the Belle experiment at the 
KEKB collider in Tsukuba is discussed here. To produce a boost By = 0.425 of the 


Fig. 9.24 Decay topology of Bop? pairs 
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'Y (4S) in the laboratory, the KEKB electron beam has an energy of 8 GeV while the 
positrons collide with an energy of 3.5 GeV. The highest instantaneous luminosity 
achieved at KEKB was £ = 2 x 10°4s~! cm? [71]. 

The Belle detector (Fig. 9.25) followed the typical onion-shape design. After an 
exchange in 2003, a silicon vertex detector (SVD) made from four layers of double- 
sided silicon strip sensors was used as inner component. The innermost (outermost) 
SVD layer was located at a radial distance of only 20 (88) mm from the collision 
point. The impact parameter resolution for charged particle tracks obtained is [71] 


Or = [2196 m um [p in GeV/c] , 
P 


31.9 
o, = | 27.8 — | um. 
p 


The SVD was followed by the central drift chamber (CDC) which extended to 
a radial distances of 88 cm. The combined tracking system of SVD and CDC 
provided a good momentum resolution, especially for low momentum tracks, thanks 
to the minimization of material. For the transverse momentum a resolution of 
Opr/pr = 0.0019 - pr €» 0.0030/B [pr in GeV/c] was achieved [71]. Particle 
identification was provided by the time-of-flight (TOF) system and by the aerogel 


Fig. 9.25 Belle detector with the silicon vertex detector (SVD), the central drift chamber (CDC), 
the time-of-flight-system (TOF) and the aerogel Cherenkov counter (ACC), the CsI electromag- 
netic calorimeter and the K; /u detector (KLM). The detector is embedded in a magnetic field of 
1.5 T produced by a solenoid with a length of 4.4 m and a diameter of 3.4 m [70] 
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Cherenkov counter (ACC). Photon and electron energies were measured in the 
electromagnetic calorimeter (ECL) consisting of thallium-doped CsI crystals. The 
energy resolution varied from 4% at 100 MeV to about 1.6% at 8 GeV [71]. All 
detector components were placed inside a super-conducting solenoid magnet of 
cylindrical shape which was providing a magnetic field of 1.5 T. The iron return 
yoke is instrumented with resistive plate chambers (KLM) and served to identify 
muons and K, with a momentum above 600 MeV/c. 


9.6.2.2 The LHCb Experiment at the Large Hadron Collider 


The multi-purpose experiments CDF and DO [73-76] operated at the pp collider 
Tevatron at Fermilab established that—despite of the harsh and high-multiplicity 
environment of the hadron collisions—precision measurements of B mesons com- 
petitive with those performed at e* e^ B-factories are possible at hadron machines. 
Both experiments have pioneered studies of the B? system which was hardly studied 
at the e*e^ machines, as kinematically the heavier B? system is only accessible 
through the decays of the Y (55). Only the Belle collaboration collected a small 
data sample of Y (55) decays. 

At the Large Hadron Collider (LHC) at CERN, the LHCb experiment [77] 
and to a lesser extent also the ATLAS and CMS experiments [78, 79] followed 
the successful path of precision B meson studies explored at Tevatron. The cross 
section for bb pair production in the pp collision of the LHC is huge, about 500 ub 
for proton-proton center-of-mass energies of 13 TeV. As only small fractions of 
the proton energies are needed to produce the bb pair, the momentum factions 
x1 and x2 of the colliding partons are in general very different and the relatively 
light bb system is boosted in the laboratory frame into either the forward or the 
backward direction. The LHCb experiment has therefore been designed as a single- 
arm forward spectrometer. With a pseudo-rapidity coverage of 2 < n < 5 about 
3596 of all produced bb pairs lie within the detector acceptance. The average boost 
of the b hadrons is large, By ~ 25, resulting into average flight distances of about 
1 cm. Although the bb cross section at the LHC is large, the total inelastic cross 
section is about a factor 200 larger. The huge rate of non b events together with the 
fact that the decay products of the b hadrons are comparably soft makes the trigger 
to the primary challenge for any B meson experiment. Typical combined trigger 
and reconstruction efficiencies vary between 0.1 and 10% depending on the decay 
channel. Channels with muons in the final state are in general easier to trigger and 
reconstruct. 

At a hadron collider the two produced b quarks hadronise separately and no 
quantum correlation between the two b hadrons exists. It is therefore harder to 
conclude from the flavor of the second b hadron (tagging B), which for the case 
that it is a neutral B meson oscillates independently, on the production flavor of the 
signal B. Effective tagging efficiencies therefore stay significantly below 10%. 

The LHCb forward spectrometer is shown in Fig. 9.26 and includes a high- 
precision tracking system consisting of a silicon-strip vertex detector (Velo) sur- 
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Fig. 9.26 LHCb detector consisting of a vertex detector (Vertex Locator), a first Cherenkov 
detector (RICH), a large-area silicon-strip detector (TT), the main tracking stations (T1, T2, T3) 
with an inner part built from silicon-strip detectors and outer part build with straw drift tubes, a 
second Cherenkov detector (RICH2), a scintillating-pad detector (SPD) and a preshower detector 
(PS) in front of an electromagnetic (ECAL) and hadronic calorimeter (HCAL, and the muon system 
with 5 stations (M1 to M5). A dipole magnet (4 Tm) is placed between the large-area silicon-strip 
detector and the main tracking stations [77] 


rounding the pp interaction region, a large-area silicon-strip detector (TT) located 
upstream of a dipole magnet with a bending power of about 4 Tm, and three stations 
of silicon-strip detectors (IT) and straw drift tubes (OT) placed downstream of the 
magnet. The tracking system provides a measurement of momentum of charged 
particles with a relative uncertainty that varies from 0.596 at low momentum to 1.096 
at 200 GeV/c. The minimum distance of a track to the primary proton-proton vertex, 
the impact parameter (IP), is measured with a resolution of 


orp = (15+ 29/pr)um [pr in GeV/c] 


The average boost and the vertex resolution translates into a decay-time resolution 
of about 50 fs. Different types of charged hadrons are distinguished using infor- 
mation from two ring-imaging Cherenkov detectors (RICH). Photons, electrons 
and hadrons are identified by a calorimeter system consisting of scintillating pads 
(SPD) and a preshower detector (PS), an electromagnetic calorimeter (ECAL) and 
a hadronic calorimeter (HCAL). Muons are identified by a system composed of 
alternating layers of iron and multiwire proportional chambers. The online event 
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selection is performed by a trigger system consisting of a hardware stage, based 
on information from the calorimeter and the muon systems, followed by a software 
stage, which applies full event reconstruction. 

The displacement of the B decay vertex from the primary interaction vertex 
is an important identification requirement of B events. However, due to the large 
boost in forward direction, the association of b hadrons to their production vertex is 
challenging, in particular if the number of primary vertices is large. In order to keep 
the average number of primary proton-proton interactions per bunch crossing at the 
most optimal value (below 2) the instantaneous luminosity for the LHCb interaction 
point was leveled by displacing the centres of the colliding beams slightly. Until 
2018 LHCb has recorded data corresponding to an integrated luminosity of 3 fb^! 
at 7 and 8 TeV, and 6 fb^! at 13 TeV proton-proton centre-of-mass energy. This 
data sample corresponds to more than 1 x 10!? bb events produced inside the LHCb 
acceptance. 


9.6.3 Measurements of B Oscillations 


Mixing in the B0 Bo system was discovered in 1987 by the ARGUS collabora- 
tion [80] and the first time-integrated determinations of the mixing parameter xg 
have been performed by the ARGUS and the CLEO collaborations [81, 82]. Time- 
dependent measurements of the mixing frequency Amg became possible at the 
electron-positron collider LEP. The silicon vertex detectors and the large boost of 
the B mesons produced at the Z resonance allowed to observe the B oscillations. 
With their high statistics and low background B samples the BABAR and Belle 
experiments improved the errors on the mixing frequency Am, significantly 
[83, 84]. Exploiting the excellent time resolution as well as the large statistics 
of recorded B mesons the LHCb experiment performed the so far most precise 
measurement of Amg [88]. 

After the first B? mixing measurement it was clear that mixing was an important 
effect also in the B? —B9 system and theory predicted a much faster oscillation of the 
B? meson. While limits on x; existed from LEP and the two Tevatron experiments it 
took until 2006 that the CDF collaboration resolved the fast mixing and performed 
the first measurement of the mixing frequency [85]. The measurement was repeated 
with much smaller uncertainties by the LHCb collaboration in 2012 [86]. 

The oscillation frequency of neutral B mesons is measured using flavor-specific 
final states, i.e. final states like B? — D(9-4£* y, or B? + Drt where the charge 
of one of the final-state particles (e.g. the lepton or pion charge) indicate the flavor 
of the decaying signal B® (e.g. b — ĉl" v). In case of incoherent production of the 
two b hadrons, a flavor-specific decay of the second B (Brag) is used to tag the flavor 
at the time t = 0 of the production. For the coherent production at the B factories the 
tagging B defines the flavor of the signal B at time At with respect to its decay—see 
Fig. 9.24 and Eq. (9.201). B-factory experiments have also used explicitly so called 
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dilepton events where both B mesons decay semi-leptonically. Flavor-mixing of one 
of the B mesons is indicated by the presence of same-sign lepton events. 

For LHCb, where the boost factor of the decaying B is not known and must 
be reconstructed from the final-state particles, semi-leptonic signal decays with an 
undetected neutrino present an additional complication compared to full hadronic 
decays. To account for the missing neutrinos in the calculation of the proper decay- 
time correction factors are used. 

An important performance number for the mixing measurement and even more 
for CP violation measurements is the tagging power or effective tagging efficiency 
Q= Etag Ding which corresponds to the efficiency to correctly tag the flavor of 
the signal B meson. The tagging power is the product of the tagging efficiency 
Etag, the probability that a specific algorithm delivers tagging information, and the 
square of the dilution factor Djag = 1 — 2m, where c is the mistag probability, 
ie. the probability that a tagging decision is wrong. At a hadron collider, B 
mesons are produced independently of each other and mistag probabilities are large 
(about 40%). The B-factories profit from the coherent BÜBÓ production and mistag 
probabilities of only a few percents are achieved for some tagging algorithms. 
Consequently, the tagging power at the B-factories is as high as 30% while for 
LHCb values only up to 6% are obtained. 

If one considers the effect of the tagging dilution Drag as well as the effect of a 
finite time resolution expressed by a dilution factor D; one expects that the measured 
flavor asymmetry Ameas parameterizes as, 


Nunmixed (t) — Nmixea (t) = cos Amt 


A H = A T -— DT 
meas (t) Nunmixed (©) + Nmixea (t) tag 71 Cosh AT /2t 


(9.202) 


where t is the decay time of the signal B. For the measurement at BABAR and Belle 
with coherently produced B mesons the time difference At = tsig — trag between 
the decay times of the signal and the tagging B needs to be used instead. Nunmixed 
and N,;,e4 are the observed numbers of unmixed (B°B®) or mixed (B°B® or B9 B9) 
events for different decay times. The effect of the finite time resolution is treated in 
the fit by a convolution with a resolution function. For LHCb, the effect of the time 


resolution on the measurement of the slow B?-BO mixing is negligible (D; ~ 1). 


For the fast oscillating BO-p9 the measured decay-time resolution of 44 fs leads to 
a dilution factor of ~ 0.73. 


9.6.3.1 Measurement of the B^-B? Oscillation Frequency 


BABAR and Belle have used large event samples of dilepton events where both B 
mesons decay leptonically with either a muon or an electron in the final-state to 
measure Amg . Figure 9.27 shows the measured decay time difference At (t — t1, 
see above) of opposite-sign and same-sign di-lepton events as measured by BABAR, 
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Fig. 9.27 Distributions of time difference A: of the two neutral B mesons for (a) opposite- 
sign and (b) same-sign dilepton events; (c) asymmetry between opposite- and same-sign dilepton 
events. Points are data and the lines correspond to result of a fit. Figure taken from [83] 


as well as the mixing asymmetry Ameas (^t). The fit to the data resulted in a value 
of Ama = (0.493 + 0.012 + 0.009) ps! [83] 

The most precise determination of Amg at the B-factories was performed by 
Belle analyzing simultaneously semi-leptonic decays B? — .D*- £v and a set of 
four different hadronic decays in a data sample of 140 fb! and resulting into the 
value Amg = (0.511 + 0.005 + 0.006) ps! [84]. The various Am; measurements 
at the B-factories have been averaged [87] to 


Ama = (0.509 + 0.003 + 0.003) ps! . 


LHCb has performed a mixing measurement using fully reconstructed hadronic final 
states as well as semi-leptonic decays. The most precise LHCb measurement was 
performed using semi-leptonic BÓ decays. A total of 1.923 x 10° B? — D7 Utva X 
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candidates and 0.829 x 10° B? — D” u*v X candidates (charge conjugated 
decays included) are used [88] and yield 


Am4 = (0.5050 + 0.0021 + 0.0010) ps7! . 


Dominated by the LHCb result, the current world average of the existing Ama 
measurements yields 


Ama = (0.5064 + 0.0019] ps^! [130]. 


Using the B lifetime average of 1.520 + 0.004 ps [130] one obtains the mixing 
parameter, 


Amq 


= 0.770 + 0.004 [130], 


Xd = 
d 


which is of similar magnitude as the value of the neutral K system, xg = 0.945 + 


0.002 (see Sect. 9.5). As predicted, the decay width difference of the B? turns out to 
be very small. The world average [130] of y, is, 


ATQ 
= — = —0.001 + 0.005 
ua 2ra 


compared to yg = —0.9965 for the K meson. 


9.6.3.2 Measurement of the B?—B? Oscillation Frequency 


Inserting the corresponding CKM elements in Eq. (9.187) one expects that the 
oscillation of the neutral B? meson is by a factor [Vis / |V;al? faster than the 
oscillation of the B?. To resolve the fast oscillation pattern represented a challenge 
and the first measurement of Am, was achieved only in 2006 by the CDF 
experiment [85]. 

The by far most precise determination of Am, was performed by LHCb [86] 
using about 34,000 B? — D; x* decays (charge conjugated decays are included). 
Figure 9.28 shows the decay time distribution dependent number of observed B? 
meson decays which decay with the same (unmixed) or the different flavor (mixed) 
with respect to their production. One can nicely observe the rather fast oscillation 
pattern. A fit accounting for the tagging dilution and for the finite time resolution 
results into 


Am; = (17.768 + 0.023 + 0.006) ps! [86]. 


This value agrees well with the theoretical prediction Am; = (18.3 + 2.5) ps^! 
which however, exhibits large errors due to the uncertainties of the hadronic 
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Fig. 9.28 Decay time 
distribution for B? candidates 
tagged as mixed (different 
flavour at decay and 
production; red, continuous 
line) or unmixed (same 
flavour at decay and 
production; blue, dotted line). 
Figure adopted from [86] 0 
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parameters fp, / Bp, [66]. With the mean B life-time, t = 1.509 + 0.004 ps“, 
one obtains for the mixing parameter 


x, = 26.72 + 0.09 [130]. 


9.6.4 CP Violation in Neutral B Meson Mixing 


CP violation in mixing, aj;;; 4 0, is predicted to be O(1074) for BÓ mesons and 
O (1075) for B? mesons [66]. As the CP violation in mixing is probed using semi- 
leptonic B decays it is often also referred to as ası (see Eq. (9.56)). In 2010, the DO 
collaboration reported an anomalous charge asymmetry in the inclusive production 
rates of like-sign dimuon events [89] indicating a significant deviation from the 
prediction. Their most recent study [90] shows a discrepancy to the theory prediction 
of about three standard deviations . 

B mesons containing a b quark undergo decays with a positively charged leptons 
in the final-state (b — c£*v;), while B mesons decay into negatively charged 
leptons (b — c ve). In case of pair production of B mesons with subsequent 
semi-leptonic decay, a pair of negatively charged leptons indicates the mixing of 
B? into a B, while the observation of a positively charged lepton pair signals the 
transformation of B? into B?. The dilepton asymmetry is measured with dimuon 
events and is defined as 


Ta —q7 
N(ut pt) + N(u-n-) 

where N(u™u*) and N(u- 2) are the number of events with two positively or 

two negatively charged muons, respectively. At a hadron collider the detection of 

two muons does not distinguish between initial B? or initial B? meson and the above 

dimuon asymmetry as determined by the DO experiment measures a combination of 

the B? and B? mixing asymmetries aĵ, and ag, defined in Eq. (9.189). 
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Measurements performed by BABAR, Belle, DO and LHCb determine separately 
the mixing asymmetries ad. and ag, for the two neutral B mesons by using partially 
reconstructed semi-leptonic decays or with fully reconstructed hadronic decays. 
Measuring the expected very tiny CP asymmetries requires a precise understanding 
of various experimental and instrumental effects, e.g. asymmetries arising from 
different detection efficiencies (Ap) or different material interaction (Az) for the 
two charge-conjugated final states. The so far most precise determination of both 
mixing asymmetries has been performed by the LHCb experiment. At the LHC 
the initial proton-proton collision is not a particle-antiparticle symmetric state. The 
number of produced b and b hadrons of a given species is not necessarily the same 
and the production asymmetry Ap has also to be taken into account when the 
measurement of the mixing asymmetry is performed. The measured asymmetry of 
a neutral B meson, independent whether it is a B? or a B®, into a final state f or its 
conjugated state f can be expressed by the mixing asymmetry ders 


N(B0/B9 > f) — N?B? > fy 
N(BD/B0 — f)(t) + N(BO/Bo > fy) 


meas — 


q 
er dir cos (Amst) 
= = eyelet Spic o M ; 9.204 
5 (^ + > ) edi (aT ,t/2) T Apt Ar ( ) 


By omitting the flavor tagging of the decaying neutral B the sensitivity to ası is 
a factor 1/2 smaller with respect to a tagged measurement. At the same time one 
dramatically wins in statistical power as no tagging is required. 

For the B? system the time dependent analysis together with the precise 
determination of Ap and A; allows to eliminate the production asymmetry Ap. 
LHCb has performed this complicated analysis [91] and obtained for the mixing 
asymmetry of the B? system the value 


ad, = (—0.02 + 0.19 + 0.30) % 


with a total precision of 3.6 per mill. In case of the fast oscillating B? system the 
measurement simplifies as the term involving the production asymmetry cancels in a 
time integrated measurement. Furthermore, the measurement was performed using 
D. wt v, X final states with the D; meson decaying to ¢ (~> K* K )x- [92]. For 
the charge symmetric K * K ^ state the interaction asymmetry is negligible and one 
only needs to correct for the pion detection asymmetry. The final result obtained for 
ds is 


ai, = (0.39 + 0.26 + 0.20) % [92], 


with a total precision of 3 per mill. 
Figure 9.29 shows the different asų measurements [91—97] for the two neutral 
B systems as well as the averages. The DO measurement indicating large mixing 
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Fig.9.29 Mixing asymmetries ag, and ad, as measured by the different experiments [91—97]. 


The DO dimuon measurement (contour) [90] determines a combination of ag, and ad. Green 
bands show the averages. The theory prediction is marked by the black dot. Figure adopted from 


[92] 


asymmetries is not confirmed by the other experiments. The measured as, values 
can be used to determine the deviation of |q / p| from unity, 


1 - |a/pl* 
pe———— — 1"20-lh/pb. (9.205) 
1+ |q/pl 
or equivalently 
PLE (9.206) 
p 2 


The most recent world averages as provided by the HFLAV group^ and published 
in [130] are for the B? system, 


ad, = 0.0021 + 0.0017, 
lg/p| = 1.0010 + 0.0008 , 


and correspondingly for the B? system, 


az, = 0.0006 + 0.0028, 
la/p| = 1.0003 + 0.0014. 


4Heavy Flavor Averaging Group 
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The values so far show no evidence for CP violation in mixing and are consistent 
with the theory prediction of very small C asymmetries [66]. 


9.6.5 CP Violation in the Interference of Mixing and Decay 


The B-factories BABAR and Belle were built with the primary goal to discover CP 
violation in the interference of B decays to the CP eigenstate B? — J/YKs with 
and without mixing. Large CP violation had been predicted for this channel. Shortly 
after the start of data-taking the two collaborations presented first measurements in 
summer 2000. A clear evidence of the first CP violation outside the neutral K meson 
was established in 2001 [59, 60]. 


9.6.5.1 B" Meson 


The expected time-dependent CP asymmetry is defined in Eq.(9.192) and 
parametrised by Eq. (9.194). Taking into account the negligible width difference, 
AT ~ 0, for B? mesons the time-dependent CP asymmtery for the channel 
B? — J/y Ks can be written as 


Ay ks (t) = AT] x, sin(Amat) — AG, g, cos(Amqt) , (9.207) 


where the coefficients ATA Ks and AT Ks have been introduced in Eq. (9.195). The 


decay B? — J/V Ks is a golden mode, dominated by a single tree-level amplitude. 
CP violation in the decay amplitudes can therefore be neglected, i.e. AG, Ks ~ 0. 
The coefficient AT x, 18 given by the CKM matrix elements involved in the short 
range box diagrams responsible for the B? mixing and by the CKM matrix elements 
appearing in the decay amplitudes. With A7} g, = —SAs/wKs (see Eq. (9.195) 


.. J/WKs 
and the definition of Ay/yKs of Eq. (9.193), 


q ÅJ yks 


AJ Kg = ; 
PES p ATE 


one can calculate the time-dependent CP asymmetry according to Eq. (9.197). For 
the decay B? — J/y Ks one finds for the ratio g/p describing the mixing and for 
the ratio of the amplitudes the following CKM factors,? 


AJIVKs Ve VŽ \ [Ves VŽ 
= " and SINKS = NI /WKs ( is i) ( : a) , (9.208) 
P Vib Vig AJ Ks Vp Ves Ves Ves 


q _ Vib Vta 


5True only up to a phase factor which cancels later on. 
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The second CKM term in the expression for the amplitude ratio is a result of the K°- 


Ko mixing necessary to produce the Ks. As the Ves terms all cancel one obtains for 


mix 
the AT A Ks 


. ve Vid Vep V* 
AT Ro = SANIT WK (4) ( a] = —ny/wKs sin(28), — (9.209) 
J/vy Ks | /V Ks Vib V5, Vi Ved /WKs 


where the last equality uses the definition of the CKM angle £ of Sect. 9.7. Finally 
one obtains for the time-dependent CP asymmetry, 


A jjyxo(t) = —n3/y xs sin (2B) sin (Amat) . (9.210) 


For the decay B? — J/ Ks the CP eigenvalue of the final state is NI /wKs = —1. 
Due to opposite CP eigenvalue of the Ky the CP eigenvalue of the decay B? — 
J/WKi is nj;jyK, = +1. Both statements are strictly true only if one ignores the 


very small CP violation in the K oK? mixing. The decay B? — J/yKy is interesting 
as one expects exactly the same magnitude of the CP violation as for B? — J/YKs 
but an opposite time behavior due to the opposite C7? eigenvalue. 

Experimentally the decay B? — J/ YK? is easy to access. The J/ọy meson 
decays into two leptons and can be easily triggered and selected. Additional 
requirements on the displacement of the B vertex from the primary vertex allow an 
almost background-free reconstruction even at a hadron collider. The measurement 
has been extended to include other (cc) resonances for which the same CP behavior 
as for the golden mode is expected. 

At the B-factories the time-dependent CP asymmetry is measured as function of 
the time difference At = f? — tı. The most precise determination of the quantity 
sin (28) has been performed by the Belle experiment using about 32,000 signal 
candidates with an average signal purity of 79% [98], 


sin (28) = 0.667 + 0.023 + 0.012. 


The time-dependent CP asymmetry for the CP-odd and CP-even final states as 
measured by Belle are shown in Fig. 9.30. The BABAR experiment has recorded 
less signal candidates (15,481 candidates with a purity of 7696 ) and obtained [99], 


sin (28) = 0.687 + 0.028 + 0.012, 


with a slightly larger statistical error than the Belle measurement. 

The LHCb collaboration has also performed the measurement of sin (28) by 
analysing the time dependent CP violation in B? — J/yKs and B? — w(2S)Ks 
decays with 52,000 and 8000 signal candidates, respectively [100, 101]. As for the 
measurement of the CP violation in mixing the observed time dependent asymmetry 
has to be corrected for the production asymmetry between B? and B? of about 1%. 
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Fig. 9.30 Flavor-tagged At distributions and raw CP asymmetries for the Belle measurements 
of sin 2f. The top plots show the B? > (cc)Ks (n f = —1) measurement and the bottom plots 
show the measurement for B? — J/y Kj, (n f = +1) The distributions are background subtracted. 
Figure adapted from [98] 


The resulting value sin (26) = 0.760 + 0.034 [101] is slightly larger than the ones 
measured at the B-factories. It is important to notice that despite the much larger 
number of signal candidates used for the LHCb measurement the total error is 
comparable with the ones of Belle and BABAR. The reason is the significantly 
larger mistag probability at a hadron collider which leads to a reduction of the 
statistical power of the events. Using all available measurements of sin (26) an 
average value of 


sin (2B) = 0.691 + 0.017 
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is obtained [87] which corresponds to a value of the phase £ of, 
B = (21.9+0.7)° or 8 = (68.140.7)°, 


where the ambiguity is not resolved. 


9.6.5.2 B? Meson 


Analogue to the measurement of the CKM phase f in the B? system, the mea- 
surement of a corresponding CP violating phase 8s, defined in Sect. 9.7, can be 
performed in the B? system, assuming that the B? oscillation can be resolved. The 
golden channel used is the decay B? — J/wd. Due to its higher mass the B? system 
cannot be produced at the Y (4S) resonance and fs was therefore not accessible at 
the B-factories. 

The two Tevatron experiments CDF and DO have performed explorative studies 
of the decay B? — J/v4 [102, 103] and also first measurements [104, 105] but 
the results have been limited by the number of recorded signal events. The first 
significant measurement of the phase? øs = —25, has been presented by the LHCb 
experiment in 2011 [106]. 

In the decay B? — J/w¢ the final-state is composed of two vector particles 
and angular momentum conservation allows for the relative angular momentum 
values L = 0,1 or 2. The final-state J/V$ is thus not a pure CP-state but a 
linear combination of CP-even and CP-odd eigenstates depending on the relative 
angular momentum L of the two vector mesons: ncp = ncp(J/v)ncp($)(— pz 
(—1)/. The measurement of the time dependent CP asymmetry requires a separate 
treatment of the CP-odd and CP-even states which can be achieved statistically by 
analyzing the angular distribution of the final-state particles J/V — jt and 
$ — K*K~. To measure the angular distribution of the final state particles it is 
common to use the helicity basis. The three decay angles necessary to describe the 
decay are denoted by (9x, 0,,, gn) and are defined in Fig. 9.31. The polar angle 0x 
(0,) is the angle between the K + (u™) momentum and the direction opposite to the 
B? momentum in the K+ K^ (+ u`) centre-of-mass system. The azimuthal angle 
between the K ^ K~ and u* u” planes is gp. 

The analysis of the B? — J/wd events is further complicated by a small 
fraction of non-resonant B? — J/VK*K- decays (CP-odd) which interfere with 
the @ meson decaying to two charged kaons. These non-resonant events need to be 
considered. Effectively one therefore studies B? decays to J/V K* KT. 


6In the Standard Model the phase $, determined by the measurement of the time-dependent CP 
asymmetry in the decay B? — J/w@ equals in leading order to —28, with f, being defined in 
Sect. 9.7. However, in case of new physics effects the two phases could differ. In the context of this 
textbook these differences are ignored. 
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K* 


Fig. 9.31 Definition of helicity angles as discussed in the text. Figure taken from [107] 


In the absence of CP violation in mixing the CP-odd and CP-even decays 
of the B? meson correspond to the heavy and light mass eigenstates. Thus 
separating statistically the polarization amplitudes in an angular analysis allows 
the measurement of the decay width Ty and I’; and thus the measurement of the 
different lifetimes of the two BÜ mass states. 

With a data-sample of 96,000 B? — J/wK*+K~ events LHCb has performed 
a simultaneous analysis of the measured decay time and the three decay angles to 
determine the time-dependent CP asymmetry and the related CP phase $, [108]. 
Figure 9.32 shows the decay time distribution as well as the distributions of the three 
decay angles. One clearly sees how the CP-odd and CP-even components can be 
statistically separated using the decay angles. The decay time distribution shows the 
different decay behavior of the two CP components with the CP-even component 
decaying visibly faster. For the average decay width I’; and for the decay width 
difference AI’, one obtains [108], 


D; = (Tz, + Ty)/2 = 0.6603 + 0.0027 + 0.0015 ps! , 


AT, =T; — Ty = 0.0805 + 0.0091 + 0.0032 ps^! . 


The simultaneous determination of the time-dependent CP asymmetry results in 
the following value for the CP violating phase ġs, 


ods = —0.058 + 0.049 + 0.006. 


The time-dependent CP violation in the B? system is thus very small and within 
the sensitivity of the measurement no C7 violation has been seen. The phase $; can 
also be measured in the mode B? — J/watn-. The J/watz~ final-state has been 
shown to be an almost pure (97.7%) CP-odd final state [109] and the CP violating 
phase can thus be extracted without the complication of an angular analysis. A much 
smaller signal sample with 27,000 events results into a comparable precision for ġs, 
ods = —0.070 + 0.068 + 0.008 [110]. The analysis of these events however, requires 
the knowledge AT, as external input. Performing a combined analysis of the two 
decay channels [108] LHCb reports a value of $; of 


$s = —0.010 + 0.039. 


600 K. Kleinknecht and U. Uwer 


Candidates / (0.2 ps) 
Candidates / 0.05 


soob Le Na d 


5 ok mem 
10 5 10 -1 -0.5 0 0.5 1 
Decay time [ps] cos Ox 
3500 SSS 3500 c T T T 
3000 = LHCb 4 Ẹ3000 LHCb 
= 


Candidates / 0.05 


cos 9, 9, [rad] 


Fig. 9.32 Decay-time and helicity-angle distributions for B} > J/y K* K- decays (data points). 
The solid blue line shows the total signal contribution, which is composed of CP-even (long-dashed 
red), CP-odd (short-dashed green) and S-wave (dotted-dashed purple) contributions. Figure taken 
from [108] 


The ATLAS and CMS collaborations have also performed measurements of the 
phase $, and of the width difference AT, [111, 112]. The overall experimental 
situation is summarized in Fig. 9.33 where also the theoretical expectation is shown. 
It should be noted that there is no ab-initio theory prediction for the phase ¢,. The 
prediction results from the measurement of sin (28) and exploits the unitarity of the 
CKM matrix, see also Sect. 9.7. The average value for $,, $, = —0.021 + 0.031 
[130], agrees very well with the expectation øs = —0.0370 + 0.0006 [113]. 


9.6.6 Direct CP Violation 


B meson decays offer a broad decay phenomenology and are an ideal system to 
look also for large direct C7? violation. Topologically very different tree-level and 
penguin decay amplitudes can result into the same final state and their interference 
can cause significant CP asymmetries. Considering the B meson decay B — f 
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Fig. 9.33 68% confidence level contours in the $,-AT', plane, showing the measurements from 
CDF, DO, ATLAS, CMS and LHCb together with their combination (white ellipse). The thin black 
line represents the Standard Model predictions. Figure taken from [130] 


and its CP conjugates B — f, direct CP violation or CP violation in the decay 
is measured as time-integrated asymmetry of the observed signal yields: 


| T8382 f)- T(B f) 


A = L—————É——— 
CU TG prrG 


For a given B decay B — f, direct CP violation can only arise if at least two 
interfering decay amplitudes A; and A» exist and if these decay amplitudes carry 
different weak and strong phases, o; and 5;. The total decay amplitude A y is given 
as sum 


Ar — |Ail eniti 4 [A2] eot 


For the amplitudes of the charge conjugated process A f the weak phases change 
sign while the strong phases stay unchanged, 
A; = |A|| ee tor y [A2] e 2482 
The expected signal yields depend on the interference of the amplitudes and the 
observable CP asymmetry is determined by the difference of the interference terms. 
The CP asymmetry is a function of the weak and strong phases as well as a function 
of the ratio R of the decay amplitudes, 


2sin($1 — $2) sin(d; — 92) 


Acp = > A Pol ui ean UR, S 
R + R^! + cos($1 — $2) cos(ô1 — 42) 


. A1 
with R= 2]. (9.211) 
Az 
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To observe large asymmetries the interfering amplitudes should have different 
weak and strong phases. While the weak phases, and in particular the weak phase 
difference, are determined by the CKM parameters, the theoretical determination 
of the strong phases is difficult. The B meson are subject to both, short and long- 
distance QCD effects which cannot be treated perturbatively. 

In the following, two classes of B decays exhibiting large direct CP violation 
will be discussed: B decays to charmless final-states, i.e. to final-states without D 
mesons, and decays to charmed final-states of the type B > DK. 


9.6.6.1 Direct CP Violation in Charmless B Decays 


As sketched in Fig. 9.34 charmless two-body B decays such as B? — K^ can 
proceed via two topologically very different classes of decay amplitudes, tree and 
penguin processes, which carry different weak and in generally also different strong 
phases. Large direct CP violation is therefore expected for these decays. 

First experimental evidence for direct CP violation in B meson decays was 
reported by the Belle collaboration in 2004 for the decay B > mtm [114]. 
As the neutral B meson decays here to a CP eigenstate the formalism of (9.194) 
and (9.207) has to be applied, i.e. the observable time dependent asymmetry can be 
parametrized by the coefficient AUT describing the effect of direct CP asymmetry 
and the coefficient A”!* to describe the effect of indirect CP violation through 
interference of diagrams with and without mixing. Belle observed a deviation of the 
coefficient Adir from zero, i.e. direct CP violation, at the level of 3.20. The result 
suggested large interference effects between the relevant tree and penguin diagrams 
and was confirmed by subsequent measurements. Today, the most recent value of 
this coefficient is AŽ? = 0.32 + 0.04 [87] and confirms the early hints for direct 
CP violation in BÓ > zz. 

The two B-factory experiments, BABAR and Belle, also studied B? > K*z- 
decays where large penguin contributions were expected. Both collaborations 
performed time-independent analyses and reported the observation of large direct 
CP violations for B? — K *z- decays ( Babar: Acp = —0.133 + 0.030 + 0.009 
[115]; Belle: Acp = —0.101 + 0.025 + 0.005 [116]). Both measurements represent 
the first doubtless observation of large direct CP violation in the BÓ system. It is 


= ^ p? 
b —e— ea à uet u 
p? 7 La 
—— — n 
d ed d d 


Fig. 9.34 Tree-level (left) and penguin (right) contribution to the charmless two-body decay 
Bo > K*z- 
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Fig. 9.35 Invariant mass spectra of (a, b) B? — K*n7 the recorded decays and zoomed in (c, d) 
to show the B? — K-* decays. Panels (a) and (c) represent the K * z^ invariant mass, whereas 
panels (b) and (d) represent the K~z7* invariant mass. The results of a mass fit describing the 
signals are overlaid. The main components contributing to the fit model are also shown. Figure 
taken from [118] 


useful to remember that direct CP violation in the neutral kaon system is many 
orders smaller (O( 1075). It is the effect of the large weak phases entering the p? 
decay amplitudes which produces the much larger direct CP violation. 

Already with the very first data-set recorded in 2011 and corresponding to only 
0.35 fb7! of data, the LHCb experiment has repeated the measurement of Ac p for 
the channel B? > K*z- [117]. LHCb's excellent resolution of the reconstructed 
K*zx* invariant mass allows to setneutsh between a decaying B? and B? and thus 
also probed the decay B? — K~x*. The measurement of Acp (BO > Krt) = 
0.27 + 0.08 + 0.02 provided the first evidence (3.30 ) for direct CP violation in 
the B? system. The measurement was repeated with more data and the observed 
direct CP violation Acp(B}? — K~x*) = 0.27 X 0.04 + 0.01 [118] confirmed 
the earlier result. Figure 9.35 shows the invariant mass distribution for the recorded 
Be jo Kn decays. Clear differences in the number of recorded decays for the B 
and the anti-B decays are observed. 


604 K. Kleinknecht and U. Uwer 


s " 
K- B- D? > [f]p 


= tt E = t = 
u u u u 


Fig. 9.36 Tree-level Feynman diagrams of the decays B^ > D°(D°)K ^. The left diagram 
implies a b — ucs transition, and is strongly suppressed by the small value of | V,,,|. The right 
diagram proceeds via a transition b — cus 


9.6.6.2 Direct CP Violation in B — DK Decays and Measurement of 
CKM Phase y 


While direct CP violation in B decays is large and experimentally easy accessible, 
the theoretical interpretation is non-trivial. The calculation of the non-perturbative 
hadronic effects and the resulting strong phases is difficult. A way to use direct 
CP asymmetries to constrain the CKM parameters is therefore a simultaneous 
determination of the weak (CKM) phases and the hadronic nuisance parameters 
including the strong phases. This approach is followed to determine the CKM phase 
y as defined in Sect. 9.7. Ignoring higher order terms, y is in good approximation 
equal to the phase of the CKM matrix element Vyp (Vin = | Viol eV), 


The phase can be probed using BT — D'(DK- decays with either a b — ucs 


(DK =) or b > cits (D? K-) tree-level quark transition as depicted in Fig. 9.36. 
The two decays can be described by three parameters: rg (~ O (0.1)), the absolute 
value of the ratio of both amplitudes; ôg the strong phase difference; and the 
CKM phase y. In order to observe the CP violating interference between the 


two amplitudes the D? and the D' mesons emerging in case of the two different 
amplitudes should decay into a common final state fp, i.e. D, D > fp. The size 
of the CP violating interference which provides the sensitivity to y is proportional 
to rp. 

The possibility of observing direct CP violation in B^ — DK- decays was 
first discussed in the 1980s [119, 120] using decays of the D(D) to neutral kaons 
and pions. Since then, several methods have been proposed which can be grouped 
according to the choice of the final state. 


* The Gronau-London-Wyler (GLW) method [121, 122] considers the decays of 
D mesons to CP eigenstates, such as the CP-even decays D? — K*K- and 
D? > at a. 

* The Atwood-Dunietz-Soni (ADS) approach [123, 124] extends this to include 
final states that are not CP eigenstates, for example D? — K*+z~ together 
with its doubly Cabibbo-suppressed counterpart D? — K 7^. The interference 
between Cabibbo-allowed and doubly Cabibbo-suppressed decay modes in both 
the B and D decays gives rise to large charge asymmetries. However, the different 
D decays require additional parameters rp and dp to describe the ratio of 
suppressed and favored D decay amplitudes as well as their phase difference. 
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* The Grossman-Ligeti-Soffer (GLS) method [125] is similar to the ADS method 
but uses singly Cabibbo-suppressed decays such as D — KsKt 7 decays. 

* The Giri-Grossman-Soffer-Zupan (GGSZ) method [126] uses self-conjugate 
multibody D meson decay modes like Ksz^z ^ or KsKtK™ and requires an 
analysis of the Dalitz plot to account for the varying D decay parameters. A 
model-dependent analysis assumes specific D decay amplitudes while a model- 
independent approach uses external input for the strong-phase difference 6p and 
the D amplitude ratio rp in bins of the Dalitz space. 


Simultaneous fits to several observables, CP asymmetries or ratios of suppressed 
to favored modes, allow the determination of the decay parameters including the 
CKM phase y. The main issue with all methods is the small overall branching 
fraction of the observable decays which range from 5 x 107? to 5 x 107? [127]. 
The precise determination of the CKM phase y therefore requires a very large data- 
sample. The B factories have pioneered different methods to determine y, however 
the achieved overall statistical precision on y was limited. Combining the different 
methods and quoting a single result for y, BABAR and Belle report the following 
values [127]: 


y = (69+ 17)? (BABAR), 
y = (68+ 14)° (Belle). 


The breakthrough towards a precision determination of the phase y came 
when the LHCb experiment was able to measure even very rare doubly Cabibbo- 
suppressed decays such as BT — [xK +] pK and its charge-conjugated 
counter-parts with sufficiently high statistics to observe CP asymmetries. Fig- 
ure 9.37 shows as an example the decay BT — [n X ^]; K^ as well as the 
decay BT > [x K^ ] p7 which also provides sensitivity to y together with the 
corresponding charge conjugated decays [128]. The observed yields clearly signal 
direct CP violation for both channels. LHCb has analysed Bt — DK7* decays 
with a multitude of different decay modes of the neutral D, Bt — D*K* and 
B+ — DK** decays and B? — DK* decays. In addition time-dependent analyses 
of B? — DTK* and B? — D* x are performed and measure combinations of 
the CKM phases y and f; or P, respectively. The combination of the different LHCb 
measurements [129] finally result into a value of 


y = (74.0739)° (LHCb), (9.212) 


in perfect agreement with the early measurements by BABAR and Belle. For the 
average of all y measurements one finds [87], 


y = (73.5140. (9.213) 
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Fig. 9.37 Invariant mass distributions of selected B^ — [x K F] př% decays, separated by 
charge. The dashed pink line left of the signal peak shows partially reconstructed B? — 
[K ta] pK —z* decays, where the bachelor pion is missed. For the two channels, one clearly 


observes a yield difference, i.e. CP asymmetry, between the two charge-conjugated channels. 
Figure taken from [128] 


9.7 Weak Quark Mixing and the CKM Matrix 


The observation of direct CP violation in the neutral K meson and the confirmation 
of large CP violating effects in the neutral B meson system provided the experi- 
mental evidence of weak quark-mixing as the primary source of CP violation in the 
hadron sector. A multitude of precision measurements confirmed the prediction of 
the quark-mixing paradigm of the Standard Model [57] and led to the Nobel prize 
for Kobayshi and Maskawa. Despite this success of experimental and theoretical 
quark flavor physics we know today that other sources of CP violation must exists 
to explain the baryon asymmetry of our universe. 


9.7.1 Quark-Mixing Matrix 


Historically quark-mixing was introduced by Cabibbo [56] to explain the different 
coupling strength of hadronic currents in weak decays of neutrons and pions 
compared to strangeness changing processes, such as the decay of K mesons 
and A hyperons. Cabibbo postulated—expressed in today's notation—that the 
weak eigenstates of the then known quarks with charge —1/3 were not the flavor 
eigenstates d and s but a linear combination, rotated by an angle 0, the Cabibbo 
angle: 


d. = d cos 0 + s sin 0 . (9.214) 
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The GIM mechanism [131], introduced to cancel the K — 4*4 amplitude, 
required the orthogonal state 


Sc = —d sin 0 +s cos 0, (9.215) 


and an additional charge +2/3 quark, the charm quark, complementing the two 
quark doublets. 

We know today that three generations of up-type quarks u’ (i = 1, 2, 3: u-, c-, 
t-quark) with electrical charge +2/3 and three generations of down-type quarks 
d! (i = 1,2,3: d-, s-, b-quark) with electrical charge —1/3 exist. In the Standard 
Model, the masses and mixing of quarks arise from the Yukawa interaction with 
the Higgs condensate which couples the left-handed quark fields uz and dz to the 
right-handed quark fields ug and dg (for better readability the generation index is 
suppressed). After spontaneous breaking of the electroweak symmetry the Yukawa 
terms give rise to masses and mixing: 

v 


LYukawa ^ — Va (di Yadn + iij Yuug) 4 h.c. 


The Yukawa matrices Yq and Y, are complex 3 x 3 matrices in generation or 
flavor space and do not need to be diagonal. Indeed, in the Standard Model they are 
not, and as a consequence the flavor states are not equal to the mass eigenstates of the 
quarks. The mass eigenstates i17, & and d L,R are obtained by unitary transformations 
of the above quark flavor states: 44 = VA uuA and d A = Va.ada (with chirality 
index A = L,R for left and right-handed quark fields, and suppressed generation 
indices). The unitary matrices V4 y and Va a diagonalize the Yukawa matrices and 
one obtains the diagonal quark mass matrices for the up- and down-type quarks, 


f v 
Mau = diag(m,, mc, m;) = V LuYuVk u’ 


E 
J/2 


The quark masses will appear as usual Dirac mass terms in the above Yukawa part 
of the Lagrangian: 


Ma = diag(mg, ms, mp) = VL aYa Vh a: 


LYukawa = —d Madr — iij Malin + h.c. 


If the up-type and and down-type Yuakawa matrices Yy and Yq cannot be 
diagonalized simultaneously by the same transformations there is a net effect of 
the change of the quark basis. The charged current terms of the Standard Model 
Lagrangian combining left-handed up and down-type quarks therefore get a flavor 
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structure imprinted which is described by the Cabibbo Kobayshi Maskawa (CKM) 
quark-mixing matrix 


Verm = ViVi a - 


The charge current terms expressed in the mass eigenstates have the form 


Lee = E", (iiy WI Vernd, + dry“ Wg Vckw'ui) ; 


Here, the matrix element (Vcgm);j connects a left-handed up-type quark of the 
ith generation to a left-handed down-type quark of the jth generation. The matrix 
elements are therefore expressed using flavor indices: 


Vud Vus Vub 
VckM = | Va Ves Veb 
Via Vis Vib 


For a non-diagonal CKM matrix the charged quark currents are thus inter-generation 
flavor changing currents. In analogy to the flavor eigenstates introduced by Cabibbo 
it is usual to absorb the CKM matrix by introducing for the down-type quarks the 
weak quark eigenstates d' = VekmdL. 

Since it is the product of unitary matrices the CKM matrix itself is unitary, 
i.e. VcxmVckm’ = 1, and its elements are in general complex. The number of 
parameters of a general unitary 3 x 3 matrix is nine, three rotation angles and six 
phases. By rephasing the quark mass eigenstates d — e!®7g one can remove five 
phases, corresponding to the five independent phase differences between the quarks, 
and leaving one CP violating phase ô. The usual parametrization of the CKM matrix 
uses the three rotation angles 015, 023, 013 and the phase ô: 


10 0 ce O sae €, Sy O 
VckM = |0 c, 553 0 1 O0 Si Cy, 0 (9.216) 
0 —54 Cr uu T 0 01 
Ci5€45 S15€15 TN 


iô iô 
75,5€533 — €15$8538,5€ C153 — S12823S13  S23C13 ? 


id 16 
8152553 — €y9 935136 —0C,5853 — $5505 $,€  C23C13 


where Sj = sin 6;; and Cj; = COS ij. in the Standard Model, the phase ô is 
responsible for all CP violating phenomena in quark-flavor changing processes. 
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The rotation angles are defined and labeled in a way which relates to the mixing 
of two specific generations. In the limit 623 = 60;3 = O the third generation 
decouples, and the situation reduces to the usual Cabibbo mixing of the first two 
generations, with 01» identified as the Cabibbo angle. The angles 012, 023, 013 can 
all be chosen to lie in the first quadrant, i.e. s;,, c;; > 0, by appropriate redefinition 
of the quark field phases. 

From measurements it is known that 1 >> sp > 54, >> s. It is therefore 
common to use a parametrization of the CKM matrix that emphasizes this hierarchy. 
In the Wolfenstein parametrization one defines 


MO 
y |Vual? + Vest" 


where A is the sine of the Cabibbo angle (sin@ ~ 0.22) and the real numbers A, p 
and 7 are of order unity. With these parameters the CKM matrix can be expressed 
in powers of A and takes the convenient form 


=s = 


3 : id 
, AM (pc in) = se’ = Vip, 


1352 A —AX(p-ig) 
V= zx (am Ar? dS. (9.217) 
AP pin) —AA 1 


As the definition of (p 4- in) depends on the phase convention, one often introduces 
the parameters (p + i5), defined by 


AX (p + ig) /1— A244 


AA iti) = —— IL ms (9.218) 
V1 — 1 [1 — A244 (p + in)| 
This definition ensures that (o + in) = —(Vud V5) Ved V) is phase convention 


independent and the CKM matrix written in the parameters A, A, p and 7 is unitary 
to all orders in A." 

The elements of the CKM matrix are fundamental parameters of the Standard 
Model and need to be experimentally determined. The unitarity condition of the 
CKM matrix imposes a set of relations between the matrix elements: 


XO Vij VÄ = ôjx with jk = (dd, ds, db, ss, sb, bb} (9.219) 
i 


> Vij Vi, = Six with ik = (uu, uc, ut, cc, ct, tt} (9.220) 
j 


7To O (42) one finds p = p(1— å? /2) . 
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Fig. 9.38 Sketch of the (p.n) 
Unitarity Triangle (UT). 
Figure taken from [133] 


Va V, 


ud "ub 


Va Vab | 


td "ib 


| V, V; 
Va Vo 


(0,0) (1,0) 


The six vanishing combinations (j Z k, i # k) describe triangles in the complex 
plane. The area of all triangles is given by half of the Jarlskog invariant J [132], 


3 
s[vuvavivis] =F Y euncjn . (9.221) 


m,n—l 


where one representation of Eq. (9.221) reads for instance J = 3 [Vu s Veb Aan v 
Expressed in the parameters of the standard CKM representation one finds J = 
—0C,5€,4€5,5,55,,5,, Sind. A nonvanishing CKM phase and hence CP violation 
requires J Z 0. 

While four of the six unitarity triangles are degenerated and rather slim, only two 
triangles have approximately equal sides, of which one is usually referred as the 
Unitarity Triangle (UT), 


Vua Vip + Vea Ven + Via Vp = 0. (9.222) 


Commonly one normalizes the triangle basis to unity by dividing each side by 
Ved V3, to obtain a triangle with vertices exactly at (0, 0), (0, 1) and the apex at 
(0, n). The UT is sketched in Fig. 9.38. As can be seen from Fig. 9.38 the angles of 
the UT are 


Via VŠ 
o = arg (4) (9.223) 
Vud Vip 
Ved 2) 
= arg | — (9.224) 
B e( Va V5 
Vud 23 
y = arg | -——_ (9.225) 
i ( Ved Va 
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Often a different naming convention, also shown in Fig. 9.38, is used to label the 
three angles. The UT angles are constraint by the CP observables discussed in 
Sect. 9.6: 
B = 21.9: 0.7)? (9.226) 
= +4.2\0 
y = (03.5751) (9.227) 


The measurement of the angle o has not been discussed in Sect. 9.6. It is measured 
using the observation of time-dependent CP violation in the decays B? > xz, 
pp, and zp by BABAR and Belle. The average of this measurements result into 
[133, 134], 


o = (84.5729), (9.228) 


The sum of three angles, a + B + y = (180+7)°, is consistent with the 
expectation and represents a first test of the unitarity of the CKM matrix. An 
additional constraint of the CKM phases and thus of the UT angles comes from the 
measurement of time-dependent CP violation in B? decays and the measurement of 
the phase £, (with $, = —2ßs, see Sect. 9.6), 


Vis Vib o 
B; = arg | — ? | = (0.60 + 0.89)°. (9.229) 
Ves cb 


In addition to the measurements of CP violation in the B meson systems also the 
measurements of CP violation in the K? K? mixing, |ex| = (2.233 +0.015) x 10? 
(see Sect. 9.5) provides information about the CKM matrix. The measurement of 
|€x | can be translated into an approximate hyperbolic constraint on the apex (p, n) 
of the UT [135]. 

The sides of the unitarity triangle are accessible by measuring decay rates and 
mixing frequencies. In the following the experimental determination of the CKM 
elements as well as a test of the unitarity of the CKM matrix is discussed. 


9.7.2 Determination of the CKM Matrix Elements 


1. |V44|. Precise determinations of |V,4| are available from nuclear beta decays, 
from the decay of the free neutron and from semileptonic pion decays (Gr * — 
79 e* v). The most precise value results from an analysis of superallowed 0* — 
OF nuclear beta decays which are pure vector transitions. The measurements of 
the transition energies, the partial branching fractions, and the half-lives of the 
parent nuclei together with radiative and isospin-symmetry-breaking corrections 
allow the determination of the corrected Ft-value, from which, by using the 
muon life-time, |V,4| can be determined. The average of the fourteen most 
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precise determinations yield [136], 
|Vua| = 0.97417 + 0.00021 , 


where the error is dominated by the theoretical uncertainty stemming from the 
nuclear structure and radiation correction. 

The theoretical uncertainties in extracting a value of |V,q| from neutron decays 
are significantly smaller than those for the superallowed decays, however the 
value depends on the ratio between axial-vector and vector couplings (g4 = 
GA/ Gv) and on the neutron lifetime. Using the most recent measurements [133] 
the following value for |V,4| is obtained, 


|Vua| = 0.9763 + 0.0016, 


with the error dominated by the g4 uncertainty. 

An alternative approach is the measurement of the very small (O(10-5)) 
branching ratio of the pion beta-decay 7+ — zz e*v. The value normalized 
by a very precise theoretical prediction for 7* — e* v and yields [137] 


|Vua| = 0.9749 + 0.0026 . 


The error here stems mainly from the measurement of the rare process. 

2. |Vys|. Earlier measurements of |V„s| from kaon decays have used Kj, — mev 
to extract the product of |V„s| and the form factor |V,,| f+(0) at q? = 0. The 
most recent data provide enough experimental constraints to also use decays to 
muons as well as decays of Ks and K~. Averaging results from KL, — xev, 
KL > mrw, K^ > zx Üetvy,K* > nÜutv and Ks — xev yields the value 
[Vus] f. (0) = 0.2165 + 0.0004 [133]. Lattice QCD calculations of f (0) have 
been carried out for different numbers of quark flavors. The form-factor average, 
f (0) = 0.9704+0.0032, of the (2+1)-flavor lattice calculations [138] is in good 
agreement with a classical calculation [139]. With this value one obtains, 


|Vus| = 0.2231 + 0.0008 . 


The lattice calculation of the ratio of kaon and pion decay constant, fx/fr = 
1.1933 + 0.0029 [138], allows the determination of the ratio |V,,/ V,4| from 
K — uv and z — uv decays. Using the precise measurement of the K > uv 
branching fraction by the KLOE collaboration [140] results in | V,,5| = 0.2253 + 
0.0007. 
An alternative determination of |V,,,| uses hadronic t decays to strange hadrons. 
The average of the measured inclusive and exclusive branching fractions yields 
|Vus| = 0.2216 + 0.0015 [134]. 

3. |V44|. First determinations of |V-q| came from neutrino scattering experiments. 
The difference of the ratio of double-muon to single-muon production for 
neutrino and antineutrino scattering is depending on the charm production cross 
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section and thus on |V.4|?, as well as on the semileptonic (muonic) branching 
ratio of the produced charm mesons, B The method was first used by the 
CDHS group [141] but has also been applied by CCFR [142] and by CHARM 
II [143]. Averaging the results is complicated, also because B, is an effective 
quantity which depends on the specific neutrino beam characteristics. One finds 
BalVsal = (0.463 + 0.034) x 107? [144] and using the average value of 
B, = 0.087 + 0.005 one obtains [133] 


Vea] = 0.230 + 0.011. 


Similar to |V,,], |V.4| can also be extracted from semileptonic (D — 7x £v) and 
leptonic (Dt — y*v) charm decays. Also here, QCD lattice calculations are 
used to determine the relevant form factors f 2? g (q? — 0) and fp [138]. Using 
the average of the branching fraction measurements from BABAR, Belle, BES 
III and CLEO-c for D > x £v [134] results into [133] 


| Vc4| = 0.2140 + 0.0029 + 0.0093, 


where the first uncertainty is experimental and the second stems from the 
theoretical uncertainty of the form factor calculation. The measurements of the 
leptonic branching ratio Dt — u*v by BES III and a CLEO-c results into [134] 


|Vea| = 0.2164 + 0.0050 + 0.0015, 


where also here the first error is experimental while the second describes the 
uncertainty of the form factor calculation. For the average of the three different 
determinations of | Veq | the Particle Data Group [133] quotes a value of 


Vea] = 0.218 + 0.004 . 


4. |Vcs|. Measurements of semileptonic decays of D mesons to kaons D — Kv 
as well as the measurement of leptonic decays of D, mesons, Dt — pry, 
together with the corresponding form factors from lattice calculations allow the 
determination of |V.,|. Branching fraction measurements have been performed 
by Belle, BABAR, CLEO-c and BES III, and are averaged in [134]. From the 
semileptonic measurements one obtains |Ves| = 0.967 + 0.025 where the error 
is dominated by the theoretical uncertainty of the form factor calculations. The 
average of the leptonic measurements results into |V.,| = 1.006 + 0.019 where 
the dominating uncertainty is experimental. For the average of both values the 
Particle Data Group [133] reports a values of 


|Ves| = 0.997 + 0.017. 
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5. |Vcp|. This matrix element is determined from semileptonic B decays to D 
or D* mesons. Two experimentally and theoretically different approaches are 
used. The inclusive approach measures the inclusive semileptonic decay rate to 
any charmed final state together with the moments of the leptonic energy and 
the hadronic invariant mass spectra. An operator product expansion within the 
Heavy Quark Effective Theory (HQET) allows the calculation of decay rates 
and the energy and mass spectra in dependence of expansion parameters a, and 
the inverse of the heavy quark mass. The simultaneous measurement of several 
distributions over-constrains the physical parameters and allows a determination 
of | Vep. An analysis and an averaging of existing measurements is performed in 
[145] and leads to |V| = (42.2 + 0.8) x 10? with uncertainties arising mainly 
from higher-order perturbative and non-perturbative corrections. 

In the exclusive approach semileptonic B decays to exclusive channels containing 
D and D* mesons are studied. In the infinite quark mass limit with mp , mc >> 
Agcp, heavy quark symmetry predicts that all form-factors are given by a 
single function which depends on the product of the four-velocities v of the 
B and v' of the D® (mesons) and are normalized at the point of maximum 
momentum transfer to the lepton system (v - v! = 1). The matrix element 
|V.p| is obtained from an extrapolation to this point. The precise determination 
of normalization and the shape of the form factor function requires additional 
corrections calculated using HQET. Reference [145] quotes |V.,| = (41.9 4 
2.0) x 10? as the average exclusive value of |V;; |. 

The inclusive and the exclusive determination of | Vep| are in agreement and [145] 
performs the average of both values: 


|Vep| = (42.2 + 0.8) x 10°. 


6. |Vub|. Similar to | Vcp| also | V,,5| is determined analysing inclusive and exclusive 
semileptonic B decays with a b — u£v transition. The determination of |V,,j| 
from inclusive decays however suffers experimental and theoretical difficulties. 
The total inclusive decay rate is hard to measure due to the large background 
from CKM-favored b — cé€v transitions. Therefore, strong kinematic cuts 
are introduced to suppress these background contributions. The restriction to 
tight kinematic regions however complicates the theoretical description. The 
calculation of partial decay rates in the various kinematic regions requires 
the introduction of non-perturbative distribution functions—the so called shape 
functions—to describe the effect of hadronic physics. At leading order there is 
only a single shape function which can be determined using inclusive B — X;y 
decays. Subleading effects are considered using different theoretical models 
based on Heavy Quark Expansion (HQE). A recent summary of the |V,,,| values 
extracted within different models from measurements by BABAR, BELLE and 
CLEO can be found in [145]. All calculations give similar values for |V,,,| and 
similar error estimates. As average [145] quotes 


[Viol = (4.49 + 0.15*015) x 107°. 
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To consider different theoretical treatments the authors assign an additional 
quadratic error of +0.17 x 107°. 

The determination of |V,,,| from exclusive decays, such as B — mév decays, 
suffers experimentally from very small signal yields and requires the theoretical 
determination of the corresponding form factors. Lattice form factor calculations 
are available for the high q? regions. So called light-cone QCD sum rules (LCSR) 
are applicable for the low q? region. A simultaneous fit to experimental B — 
mv data and lattice results as function of q? together with additional constraints 
from LCSR results into 


|Vu5| = (3.67 + 0.09 + 0.12) x 107? [134]. 
The inclusive and exclusive determinations of |V„p| are largely independent and 
the large discrepancy between both methods remains a puzzle. Inflating the errors 


to account for this discrepancy [145] quotes an average of 


|V,5| = (3.94 + 0.36) x 1077 


The LHCb experiment has used the ratio of the two baryonic decays Ap — 
pe v and Ap > Ac v to extract the ratio | Vp / V.5| = 0.083 + 0.006 [146]. 
The q?-dependent form factor ratio had to be taken into account to consider the 
different kinematical ranges of the two decays. Using the above average for | Vep| 
one obtains 


|V,5| = (3.50 + 0.26) x 1077. 


7. |V|. An experimental determination of |V;,| without assuming unitarity is 
possible using the production of single-top quarks. Single-top quark production 
cross-sections have been measured by the Tevatron experiments CDF and DO, 
and at LHC by ATLAS and CMS. Using these measurements [133] quotes the 
value 


[Vip] = 1.019 + 0.025. 


8. |V;4| and |V;s|. The two CKM elements |V;4| and |V;,| are expected to be very 
tiny and tree-level decays of the top-quark to a d-quark or s-quark will be 
very difficult to measure. However, both elements are accessible through the 
measurements of the mixing frequency of B? and B? mesons (see Sect. 9.6). 


Using the most recent lattice QCD results for the hadronic factors, fp, Bp = 


(219+ 14) MeV and fp, ,/ B, = (270+ 16) MeV [138], together with the world 
averages for the mixing frequencies Am and Am; results into [133] 


|V,a| = (8.1 + 0.5) x 107? and 


lVis| = (39.4 + 2.3) x 1077, 
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where the uncertainties are dominated by the lattice uncertainties of the hadronic 
factors. Several uncertainties are reduced when calculating the ratio 


£= (ron) y (mo Bn) = 1.239 + 0.046 . 


The ratio |V;4|/|V;;| is therefore stronger constraint [133], 


|Via|/|Vis| = 0.210 + 0.001 + 0.008 . 


Using the independent measurements of the CKM elements the unitarity of the 
CKM matrix can be checked. One obtains for the first two rows [133], 


Veal? + |Vus|? + |Vup|? = 0.9994 + 0.0005 , 
Vea] + |Ves|2 + | Va = 1.043 + 0.034 , 


which agrees well with the unitarity assumption. In addition, the direct measurement 
of | V;,| leaves little room for mixing of the top into unknown other states. 


9.7.3 Global Analysis and Test of the Unitarity of the CKM 
Matrix 


The available information on the magnitude and the phases of the CKM elements 
can be analysed by a global fit to all CKM parameters. The most precise information 
is obtained imposing Standard Model constraints such as the unitarity of the CKM 
matrix and the existence of exactly three quark generations. Input parameters of 
the global analysis are, beside the experimental measurements, also theoretically 
determined hadronic parameters with sometimes large errors. 

Different approaches exist to combine the experimental data and to treat the 
experimental and theoretical errors. The CKM-Fitter group [135, 147] is using 
a frequentist’s framework based on a x? analysis. In these fits the frequentist’s 
treatment is also applied to the theoretical errors. The UTfit group [148] uses a 
baysian approach for all errors. The two different statistical approaches lead to very 
similar results and here only the results of [135] are presented. 

Figure 9.39 shows the experimental constraints in the (o, 7) plane. Indicated 
is the unitarity triangle with its angles a, 6 and y. The different measurements 
clearly limit the apex of the triangle to a small (dashed) region. For the Wolfenstein 
parameters introduced in Eq. (9.217) the global fit [133, 135] gives 


A = 0.22453 + 0.00044 , A = 0.836 + 0.015, (9.230) 
p = 0.122100, z= 0.355+0917 , (9.231) 
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Fig. 9.39 Experimental constraints of the unitarity triangle. Shaded areas correspond to 95% C.L. 
Figure taken from [133] 


Alternatively one can give the fit results of the magnitudes of all nine elements of 
the CKM matrix: 


0.97446 + 0.00010 0.22452 + 0.00044 0.00365 + 0.00012 
VckM = | 0.22438 + 0.00044 0.973591000010 — — 0.04214 + 0.00076 


0.00896*000022 — 0.04133 + 0.00074 — 0.999105 + 0.000032 
(9.232) 


Figure 9.39 represents an impressive confirmation of the CKM paradigm which 
describes the flavor transition and the CP violation in the quark sector. The global 
fit currently does not point to deviations from the Standard Model picture. However, 
the effect of New Physics might be small. Over-constraining measurements of CP 
asymmetries, mixing and flavor changing decays will put further bounds on possible 
New Physics contributions. 
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9.8 Conclusion 


Nearly 40 years after the discovery of CP violation, the nature of the phenomenon 
has been clarified experimentally. In the K meson system, C7 violation has been 
discovered in the mixing (£), in the decay (e’/e, direct CP violation), and in the 
interference between mixing and decay (Sm £); in the B meson system, CP violation 
in the interference between mixing and decay (sin 28) as well as CP violation in the 
decay have been observed. While finishing this article, the LHCb collaboration has 
also reported the first observation of CP violation in the decay of neutral D mesons 
to K* K- and rtr” (O(1.5 x 1075)) [205]. All observations are consistent? with 
the model of Kobayashi and Maskawa where the 3 x 3 mixing matrix of six quarks 
has one non-trivial complex phase ô = (71.0 + 0.3)°. This leads to complex weak 
coupling constants of quarks, and to 7 and CP violation. 

In cosmology, CP violation together with a large departure from thermal 
equilibrium via a first-order electroweak phase transition [206] and a baryon number 
violation by instanton processes [207] could explain the observed baryon asymme- 
try. The source would be the asymmetric interactions of quarks and antiquarks with 
the Higgs field [208]. The size of the observed CP violation in the quark sector, 
expressed by the Jarlskog determinant, however is insufficient to explain the matter 
asymmetry by several orders of magnitude [209]. Moreover, the low mass of the 
Higgs boson disfavours baryogenesis during the electroweak phase transition [210]. 
A way out of this dilemma is the hypothesis that CP violation in the neutrino sector 
could cause a lepton asymmetry. ( B4-L)-violating processes before the electroweak 
phase transition could then convert the lepton asymmetry into a baryon asymmetry. 
Leptogenesis would proceed through the production of heavy Majorana neutrinos in 
the early Universe [211]. While the experimental proof of leptogenesis is difficult, 
the observation of CP violation in the neutrino sector and the exploration of the 
nature of the neutrino (Dirac or Majorana fermion) could establish strong hints. 

In quark flavor physics the focus has changed after the successful establishment 
of the Kobayashi-Maskawa paradigm. Today, the precise measurement of flavor 
changing processes and CP asymmetries are used as tools to test the Standard 
Model predictions. Tiny deviations between observations and theory predictions 
could hint to additional quantum corrections modifying the size and the phase of 
flavor changing amplitudes. The origin of these additional corrections could be 
new heavy particles at mass scales much higher than the energies accessible by 
today's collider experiments. Experimental results on neutral meson mixing already 
constrain the ratio between mass scale and the coupling strength of new particles to 
values above 10? to 10* TeV, where the strongest bounds come from the neutral K 
meson system [212]. The next generation of precision quark-flavor experiments will 


8For the D mesons the theoretical predictions still have large uncertainties. For K mesons recent 
theoretical results on the value of the CP violating amplitude Ñ G Je) both from dual QCD [213] 
and from lattice QCD [214] deviate from the experimental value reported in Sect. 9.5, thus further 
clarification is needed. 
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thu 


s open a window to New Physics complementary to the direct searches at highest 


energies. 
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Chapter 10 A 
The Future of Particle Physics: The LHC ee 
and Beyond 


Ken Peach 


10.1 2017 Update 


I have been asked to submit a revised version of this chapter, published almost a 
decade ago. However, I think that it is better to leave the historical record as it 
was—this was an article written in its time and for its time. If I was writing this 
article today, I would call it “The Future of Particle Physics—Beyond the LHC”, 
in recognition of the fact that, when originally written, the LHC was still under 
construction and now it has been operating for several years. The other key event 
which informed the original article was the recently-developed European Strategy 
for Particle Physics, adopted by the CERN Council in July 2006; the strategy was 
updated in 2013 and formally adopted in May of that year [1]; as I write, the process 
of updating the strategy is under way. 

The intervening decade has seen significant progress, most spectacularly in the 
observation of the Higgs scalar [3] at 125.10 + 0.14 GeV/c? [2] which completes 
the Standard Model of the particles and their interactions (excluding gravity) which 
dominate the local region of the universe. This means that all of the free parameters 
of the Standard Model are now known with reasonable precision, which can be 
chosen as (see Table 10.1): 


. The quark masses (my, Ma), (mc, ms), (Mt, Mp); 
. the charged lepton masses me, Mp, Mz; 

. the Z? mass Mz; 

. the Higgs Mass My; 

. the electromagnetic coupling constant a; 

. the strong coupling constant a.(Mz); 
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Table 10.1 Parameters of the Standard Model [2] 


My Me m, 

2.14*02 MeV/c? | |1.27 € 0.02 GeV/c? 172.9 + 0.4 GeV/c? 

Mq Ms mp 

4.67*015 MeV/c? | 9345! MeV/c? 4.18*009 GeV/c? 

Me My mr 

0.5109989461(31)  |105.6583745(24) 1776.86(0.12) MeV/c? 

MeV/c? MeV/c? 

Mz Mu 

91.1876 + 0.0021 | 125.10 + 0.14 GeV/c? 

GeV/c? 

a GF os (Mz) 

7.2973525664(17) | 1.1663787(6) x 10-5 — |0.1179(10) 

x 10? GeV? 

X A p n 

0.22453 + 0.00044 | 0.836 + 0.015 0.122 + 0.0175 0.355 + 0.0115 

Derived quantities sin?0w Mw = M; cos 0,, sin?20,, = ur 
0.23129 + 0.00005 80.385 + 0.015 GeV/c? | wae 


7. the weak coupling constant Gp; and 
8. the Cabibbo-Kobayashi-Maskawa (CKM) parameters in the Wolfenstein 
parametrization X, A, p and n. 
In addition, two other useful quantities are given—Myw and sin20w—which 
can be written in terms of the other parameters, as shown in the last line of 
Table 10.1. 


This is a total of 18 parameters which can only (so far) be determined by 
experiment. However, once determined, they describe the observed interactions with 
remarkable precision; for example, the muon anomalous magnetic moment (aver- 
aged between p+ and jL—) is (11,659,209 1 + 54s; + 33syst) x 107!!, to be com- 
pared with the theoretical expectation of (11,659,180 3 + lew + 49pgaa) x 107!!, 
where the first error in the theoretical calculation comes from electroweak correc- 
tions and the second comes from a combination of low-order (42) and higher order 
(26) hadronic corrections [4]. The difference between these is (288 + 80) x 10-!! 
(3.5 standard deviations) and it is, of course, hotly debated whether this is 
statistically significant and, if it is, what that significance might be. There is a 
new experiment at Fermilab [5] which aims to reduce the total error in the muon 
anomalous magnetic moment to 14 x 107!!, which, even without improvements to 
the theoretical calculations, would (if the central value of the discrepancy remained 
unchanged) increase the significance to more than 5 standard deviations. 

Within the Standard Model, because the neutrinos have only one helicity state, 
they must be strictly massless. However, the phenomenon of neutrino oscillations, 
now well established, requires the neutrinos to have mass—essentially, the oscilla- 
tion is a beating in the propagation through time of the different mass eigenstates 
that form the flavour eigenstates. The past decade has seen enormous progress in 
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Table 10.2 Neutrino Oscillation Parameters [6, 7]* 


Normal hierarchy Inverted hierarchy 

Am‘, | Am}, | 
7.371020 x 1075 eV?/ct 2.50 + 0.04 x 1073 eV7/c# 2.46 + 0.04 x 107° eV7/c* 

0.14 
Sin?015 Sin^65 

4-0.019 +0.060 3-0.023 
0.297 5016 0.439" y oig 0.569" y 060 
2m (Ami) SE 0.0011 0.0010 

+0. +0. 

? 0.0214 0:0010 0.0218 9591 
Xm, S/n 
« 0.183 eV (95% CL) 15555 1t 


“The 3-sigma limits have been converted into 1-sigma asymmetric errors 


the determination of the neutrino oscillation parameters (see Table 10.2). Although 
the mathematics of the flavour mixing is similar in the quark and neutrino sectors, 
the phenomenology is very different. In the quark sector, the flavour oscillations— 
described by the CKM matrix—take place in either mixing or decay of bound 
states. In the neutrino sector—described by the Pontecorvo-Maki-Nakagawa-Sakata 
(PMNS) matrix—the flavour oscillations take place in vacuum, although there 
is an additional term for neutrino transmission in matter to take account of the 
different interaction cross-sections of the electron neutrinos, and the muon and 
tau neutrinos (and, of course, their antineutrinos). The main uncertainty is whether 
the third neutrino mass is heavier than the other two (normal hierarchy) or lighter 
(inverted hierarchy), with an emerging preference [8] for normal hierarchy. Most 
impressive is the evidence that the CP-violating phase angle in the PMNS matrix 
(8) is non-zero —0/2x is excluded at the 3-standard deviation level, and x at 2.5 
(1.5) standard deviations for the normal (inverted) hierarchy. The best limits on 
the absolute neutrino mass scale come from cosmology, giving an upper limit on 
Xm, of 0.183 eV at the 95% confidence level [7]. There are essentially two ideas 
for extending the Standard Model to include finite neutrino masses. The first notes 
that, since the neutrino has no electric charge, it is possible to construct a mass- 
like term from the left-handed neutrino and right-handed-antineutrino fields, which 
changes its nature to that of a Majorana particle [9]. (If the neutrinos are indeed 
Majorana particles, there are two additional phase angles (o and B) in the PMNS 
matrix which are, however, extremely difficult to measure.) The second postulates 
the existence of a heavy right-handed neutrino which, in combination with a very 
high mass scale (like the GUT scale), produces through the “see-saw” mechanism 
one light left-handed neutrino («« 1 eV/c?) and a heavy right-handed neutrino 
(>> 10!* GeV/c?)—as the left-handed neutrino becomes lighter the right-handed 
neutrino becomes heavier through the “see-saw”. One consequence of these is that 
there are interactions that allow a change of lepton number by 2 units, thus enabling 
neutrinoless double-B decay to occur. (For a recent review, see [10].) 

The other area where the Standard Model obviously fails is in the Dark Sector— 
Dark Matter and Dark Energy. The cosmological and astronomical evidence for 
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these is overwhelming (see reference [11] for a recent review) with a recipe for 
the current state of the Universe containing about 70% of Dark Energy, 25% Dark 
Matter and 5% “normal” matter, the only part addressed by the Standard Model. 
From a particle physics perspective, Dark Matter is more tractable. First, there 
is the challenge of directly detecting it using large volume, sensitive detectors 
deep underground. The past decade has seen the limits steadily improved (see 
reference [12] for a recent review), while the small number of claimed signals 
have still not been independently confirmed. There is an ongoing programme of 
experiments at the major underground laboratories which should improve these 
limits by several order of magnitude over the next few years. Second, there are 
straightforward extensions of the Standard Model which provide candidates for 
Dark Matter. Unfortunately, there is no reliable estimate for the mass scale at which 
these new particles might appear. Such evidence might come from the Dark Matter 
searches themselves. The other place where a mass scale might be forthcoming is in 
the analysis of the deficiencies of the Standard Model. For example, Supersymmetry 
was proposed to address some hierarchy problems within the model, and also 
to assist in Grand Unification, both of which pointed to a TeV energy scale for 
appearance of new particles. However, so far these have failed to materialize at 
the LHC; unfortunately, the parameter space for Supersymmetry is extensive and 
so, while this might be disappointing it may not be conclusive. Dark Energy, on the 
other hand, is still mysterious and itis unclear where it fits in to the global picture; its 
understanding might require the reconciliation of quantum mechanics with general 
relativity. 

The “open questions within the Standard Model” identified in the original article 
have been largely resolved, although there are still details to be addressed, for exam- 
ple the transition from perturbative to non-perturbative QCD. However, the “open 
questions beyond the Standard Model” remain, refined somewhat; for example, the 
“pattern of fermion masses” might nowadays be cast as “understanding the Yukawa 
couplings of the fundamental fermions to the Higgs". The improvements in the 
knowledge of the neutrino oscillation parameters do not explain how the mixing 
comes about, nor how the neutrinos derive their (tiny) masses. Again, there has 
been much progress in the measurement of the various cosmological parameters, 
including those which either determine or are determined by particle physics, but 
the “open questions in particle physics and cosmology” are still far from resolved. 
For example, the arguments for Grand Unification of the strong and electroweak 
forces remain intact (see Fig. 10.1) even though there is, as yet, no direct evidence 
for the existence of Supersymmetric particles. A second area which points to new 
physics is to note that, of the 18 parameters that define the Standard Model, 11 
are directly related to mass (the quarks, the charged leptons, the Z and the Higgs), 
which becomes 12 if the Fermi constant is replaced (as it can be) by the mass of 
the W. Given this, there are three mass hierarchies to be explained. First, within a 
generation, what determines the relative masses of the charged lepton, the down- 
type quark (charge —1/3) and the up-type quark (charge +2/3) (and why is the 
up-quark lighter than the down quark, while the charm and top quarks are heavier 
than the strange and bottom quarks)? Second, what determines the large (several 
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Fig. 10.1 The evolution of the fundamental forces as a function of energy in the standard model 
(left) and a particular set of parameters in the Minimal Supersymmetric Standard Model (right). 
(Adapted from [13]) 


orders of magnitude) mass differences between the generations of the same type of 
fundamental fermion. Third, what determines the masses of the heavy field bosons 
(Higgs and the W and Z)? (The other gauge bosons—the photon and the gluon— 
remain massless, protected by an unbroken gauge symmetry.) The origin of the 
mixing matrix and CP-violation in the quark sector is still unresolved, as are the 
masses and oscillation parameters in the neutrino sector. 

The detection of Gravitational Waves from the merger of two black holes [14] 
and the subsequent detection of gravitational waves from a binary neutron star 
inspiral [15] open the door to a new era of observational astronomy of matter under 
extreme conditions, and may also provide insight into the long-standing issue of 
the incompatibility of general relativity with quantum mechanics. Resolving this is 
likely to require understanding of physics at the Planck scale. 

Given this, the main conclusions of the “Way Forward" are largely unchanged. 
Clues to the physics that must exist beyond the Standard Model could come directly 
from the observation of new states of matter (e.g. Supersymmetry), either at the LHC 
or at a new energy frontier machine, or indirectly from precision measurements 
of Standard Model parameters, for example at a Linear Collider built to produce 
large numbers of Higgs particles under clean conditions. Alternatively, ultra- 
high precision measurements of low energy parameters (g-2, |. — e conversion, 
neutrinoless double B decay...) as well as new astrophysical observations (Dark 
Matter, gravitational waves) could give strong pointers toward the new physics. 
The tremendous progress in the neutrino sector from, particularly, T2K and Nova 
probably means that the arguments in favour of a neutrino factory have weakened 
somewhat, but the arguments for the other facilities discussed (including the 
luminosity upgrade for the LHC) remain and have strengthened. 

A recent publication by the T2K collaboration (Nature 580, 339—344 (2020)) 
reports that “The 3o confidence interval for dcp, which is cyclic and repeats every 
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2€, is [—3.41, —0.03] for the so-called normal mass ordering and [—2.54, —0.32] 
for the inverted mass ordering". 
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or OpenStack swift? Object Storage. Therefore, a likely scenario is that current 
storage access techniques used by particle physics will become unsupported in the 
future. Of course, virtualization and Cloud techniques will make it possible for the 
particle physics community to deploy these legacy technologies themselves, with 
the corresponding expenses in personnel. A better way forward is to collaborate 
with industry, and between various Resource Centers, to test alternative tools and 
methods. Realistic evaluations are far from trivial, however, as they may require 
substantial investments by all parties, for example to provide UMD Grid or Cloud 
interfaces. 

Another looming issue is the efficiency of use of the CPUs under data-intensive 
conditions. One basic assumption of the distributed architecture used by the particle 
physics community, as described above, is the existence of an infinitely powerful 
network connecting all elements, which in effect assumes infinitely powerful 
data servers as well as data ingestion by CPUs. Ever increasing data processing 
requirements and more and more powerful multi-core CPU nodes, however, are 
revealing data serving performance bottlenecks in clusters coming from the network 
and disk and CPU server limitations. Removing these bottlenecks may require more 
sophisticated architectures to be deployed. 

A much more severe issue is related to the simplistic, often naive, manner 
in which large particle physics projects pretend to use Grids and Clouds. They 
essentially desire them to behave as a very large cluster perfectly tuned to their 
needs, ignoring that imposing unreasonably high requirements in peak network 
and data serving rates within and between clusters can be very expensive. A Grid 
would better be viewed as a loosely connected federation of largely autonomous, 
self-sufficient clusters, with some of these clusters possibly hosted on commercial 
services. 

These and other future challenges are being addressed in a coherent manner by 
the particle physics community through the HEP Software Foundation?^?* and will 
certainly require a vigorous new cycle of research and development, in collaboration 
with computer scientists and engineers, the supercomputing and HPC communities 
and industry. 
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1712.06982. Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States); Brookhaven 
National Laboratory (BNL), Upton, NY (United States); Lawrence Berkeley National Lab. 
(LBNL), Berkeley, CA (United States); SLAC National Accelerator Lab., Menlo Park, CA (United 
States); Thomas Jefferson National Accelerator Facility (TJNAF), Newport News, VA (United 
States); Argonne National Lab. (ANL), Argonne, IL (United States), 2017. 

8Slides from the Computing in High Energy Physics 2018 conference (proceedings to 
be published) https://indico.cern.ch/event/587955/contributions/3012294/attachments/1681524/ 
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Chapter 15 ff) 
Statistical Issues in Particle Physics PEE 


Louis Lyons 


15.1 Introduction 


In recent years there has been a growing awareness by particle physicists of the 
desirability of using good statistical practice. This is because the accelerator and 
detector facilities have become so complex and expensive, and involve so much 
physicist effort to build, test and run, that it is clearly important to treat the data with 
respect, and to extract the maximum information from them. The PHYSTAT series 
of Workshops and Conferences[1—12] has been devoted specifically to statistical 
issues in particle physics and neighbouring fields, and many interesting articles can 
be found in the relevant Proceedings. These meetings have benefited enormously 
from the involvement of professional statisticians, who have been able to provide 
specific advice as well as pointing us to some techniques which had not yet filtered 
down to Particle Physics analyses. 

Analyses of experimental data in Particle Physics have, perhaps not surprisingly, 
tended to use statistical methods that have been described by other Particle Physi- 
cists. There are thus several books written on the subject by Particle Physicists[13]. 
The Review of Particle Physics properties[14] contains a condensed review of 
Statistics. 

Another source of useful information is provided by the statistics committees 
set up by some of the large collaborations (see, for example, refs. [15-18]). Some 
conferences now include plenary talks specifically on relevant statistical issues 
(for example, Neutrino 2017[19], NuPhys17 and NuPhys18[20]), and the CERN 
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Summer Schools for graduate students regularly have a series of lectures on statistics 
for Particle Physics[2 1]. 

This article is a slightly updated version of the one that appeared in ref. [22] in 
2012. 


15.1.1 Types of Statistical Analysis 


There are several different types of statistical procedures employed by Particle 
Physicists: 


* Separating signal from background: Almost every Particle Physics analysis 
uses some method to enhance the possible signal with respect to uninteresting 
background. 

* Parameter determination: Many analyses make use of some theoretical or 
empirical model, and use the data to determine values of parameters, and their 
uncertainties and possible correlations. 

e Goodness of fit: Here the data are compared with a particular hypothesis, often 
involving free parameters, to check their degree of consistency. 

* Comparing hypotheses: The data are used to see which of two hypotheses is 
favoured. These could be the Standard Model (SM), and some specific version of 
new physics such as the existence of SUperS Ymmetry (SUSY), or the discovery 
of the Higgs boson[23]. 

* Decision making: Based on one's belief about the current state of physics, the 
value of possible discoveries and estimates of the difficulty of future experiments, 
a decision is made on what should be thrust of future research. This subject is 
beyond the scope of this article. 


15.1.2 Statistical and Systematic Uncertainties 


In general any attempt to measure a physics parameter will be affected by statistical 
and by systematic uncertainties. The former are such that, if the experiment were 
to be repeated, random effects would result in a distribution of results being 
obtained. These can include effects due to the limited accuracy of the measurement 
devices and/or the experimentalist; and also from the inherent Poisson variability 
of observing a number of counts n. On the other hand, there can be effects that 
shift the measurements from their true values, and which need to be corrected for; 
uncertainties in these corrections contribute to the systematics. Another systematic 
effect could arise from uncertainties in theoretical models which are used to interpret 
the data. Scientists’ systematics are often ‘nuisance parameters’ for statisticians. 
Consider an experiment designed to measure the temperature at the centre of 
the sun by measuring the flux of solar neutrinos on earth. The main statistical 
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uncertainty might well be that due to the limited number of neutrino interactions 
observed in the detector. On the other hand, there are likely to be systematics from 
limited knowledge of neutrino cross-sections in the detector material, the energy 
calibration of the detector, neutrino oscillation parameters, models of energy con- 
vection in the sun, etc. If some calibration measurement or subsidiary experiment 
can be performed, this effectively converts a systematic uncertainty into a statistical 
one. Whether this source of uncertainty is quoted as statistical or systematic is 
not crucial; what is important is that possible sources of correlation between 
uncertainties here and in other measurements (in this or in other experiments) are 
well understood. 

The magnitude of systematic effects in a parameter-determination situation can 
be assessed by fitting the data with different values of the nuisance parameter(s), 
and seeing how much the result changes! when the nuisance parameter value is 
varied by its uncertainty. Alternatively the nuisance parameter(s) for systematic 
effects can be incorporated into the likelihood or x? for the fit; or a Bayesian method 
involving the prior probability distribution for the nuisance parameter can be used. 
(See Sects. 15.4.5 and 15.7.6 for ways of incorporating nuisance parameters in upper 
limit and in p-value calculations respectively). 

How to assess systematics was much discussed at the first Banff meeting[6] and 
at PHYSTAT-LHC[24—26]. A special session of the recent PHYSTATv meeting at 
CERN[12] was devoted to systematics. Many reviews of this complex subject exist 
and can be traced back via ref. [27]. 

In general, much more effort is involved in estimating systematic uncertainties 
than for parameter determination and the corresponding statistical uncertainties; this 
is especially the case when the systematics dominate the statistical uncertainty. 

Cowan[35] has considered the effect of having an uncertainty in magnitude of a 
systematic effect. As Cox has remarked[36], there is a difference in knowing that a 
correction has almost precisely a 20% uncertainty, or that it is somewhere between 
0% and 40%. 


15.1.3 Bayes and Frequentism 


These are two fundamental approaches to making inferences about parameters or 
whether data support particular hypotheses. There are also other methods which do 
not correspond to either of these philosophies; the use of x? or the likelihood are 
examples. 

Particle physicists tend to favour a frequentist method. This is because in many 
cases we really believe that our data are representative as samples drawn according 
to the model we are using (decay time distributions often are exponential; the counts 


'Tf the simulation yields a change in the result of a + b, there is much discussion about how the 
contribution to the systematic uncertainty should be assessed in terms of a and b—see ref. [27]. 
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in repeated time intervals do follow a Poisson distribution; etc.), and hence we 
want to use a statistical approach that allows the data “to speak for themselves”, 
rather than our analysis being sensitive to our assumptions and beliefs, as embodied 
in the assumed Bayesian priors. Bayesians would counter this by remarking that 
frequentist inference can depend on the reference ensemble, the ordering rule, the 
stopping rule, etc. 

With enough data, the results of Bayesian and frequentist approaches usually 
tend to agree. However, in smallish data samples numerical results from the two 
approaches can differ. 


15.1.3.1 Probability 


There are at least three different approaches to the question of what probability is. 
The first is the mathematical one, which is based on axioms e.g. it must lie in the 
range 0-1; the probabilities of an event occurring and of it not occurring add up to 
1; etc. It does not give much feeling for what probability is, but it does provide the 
underpinning for the next two methods. 

Frequentists, not surprisingly, define probability in terms of frequencies in a 
long series of essentially identical repetitions” of the relevant procedure. Thus the 
probability of the number 5 being uppermost in throws of a die is 1/6, because that 
is the fraction of times we expect (or approximately observe) it to happen. This 
implies that probability cannot be defined for a specific occurrence (Will the first 
astronaut who lands on Mars return to earth alive?) or for the value of a physical 
constant (Does Dark Matter contribute more than 25% of the critical density of the 
Universe?). 

In contrast, Bayesians define probability in terms of degree of belief. Thus 
it can be used for unique events or for the values of physical constants. It can 
also vary from person to person, because my information may differ from yours. 
The numerical value of the probability to be assigned to a particular statement is 
determined by the concept of a ‘fair bet’; if I think the probability (or “Bayesian 
credibility’) of the statement being true is 20%, then I must offer odds of 4-to-1, and 
allow you to bet in either direction. 

This difference in approach to probability affects the way Bayesians and 
frequentists deal with statistical procedures. This is illustrated below by considering 
parameter determination. 


15.1.3.2 Bayesian Approach 


The Bayesian approach makes use of Bayes’ Theorem: 


p(A|B) = p(GB|A) x p(A)/p(B), (15.1) 


Bayesians attack this concept of ‘essentially identical trials’, claiming that it is hard to define it 
without using the concept of probability, thus making the definition circular. 
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where p(A) is the probability or probability density of A, and p(A|B) is the 
conditional probability for A, given that B has happened. This formula is acceptable 
to frequentists, provided the probablities are frequentist probabilities. However 
Bayesians use it with A = parameter (or hypothesis) and B = data. Then 


p(parameter |data) x p(data| parameter) x p(parameter), (15.2) 


where the three terms are respectively the Bayesian posterior, the likelihood function 
and the Bayesian prior. Thus Bayes’ theorem enables us to use the data (as 
encapsulated in the likelihood) to update our prior knowledge (p(parameter)); the 
combined information is given by the posterior. 

Frequentists object to the use of probability for physical parameters. Further- 
more, even Bayesians agree that it is often hard to specify a sensible prior. For a 
parameter which has been well determined in the past, a prior might be a gamma 
function or log-normal or a (possibly truncated) Gaussian distribution of appropriate 
central value and width, but for the case where no useful information is available the 
choice is not so clear; it is easier to parametrise prior knowledge than to quantify 
prior ignorance. The ‘obvious’ choice of a uniform distribution has the problem of 
being not unique (Should our lack of knowledge concerning, for example, the mass 
of a neutrino m, be parametrised by a uniform prior for m, or for m? or for log my, 
etc?). Also a uniform prior over an infinite parameter range cannot be normalised. 
For situations involving several parameters, the choice of prior becomes even more 
problematic. 

It is important to check that conclusions about possible parameter ranges are 
not dominated by the choice of prior. This can be achieved by changing to other 
‘reasonable’ priors (sensitivity analysis); or by looking at the posterior when the 
data has been removed. 


15.1.3.3 Frequentist Approach: Neyman Construction 


The frequentist way of constructing intervals completely eliminates the need for 
a prior, and avoids considering probability distributions for parameters. Consider a 
measurement x which provides information concerning a parameter u. For example, 
we could use a month's data from a large solar neutrino detector (x) to estimate the 
temperature at the centre of the sun (jz). It is assumed that enough is known about 
solar physics, fusion reactions, neutrino properties, the behaviour of the detector, 
etc. that, for any given value of jz, the probability density for every x is calculable. 
Then for that u, we can select a region in x which contains, say, 90% of this 
probability. If we do this for every u, we obtain a 90% confidence band; it shows 
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X% X X; 


Fig. 15.1 The Neyman construction for setting a confidence range on a parameter u. At any value 
of u, it is assumed that we know the probability density for obtaining a measured value x. (For 
example, u could be the temperature of the fusion reactor at the centre of the Sun, while o is the 
solar neutrio flux, estimated by operating a large underground solar neutrino detector for 1 month.) 
We can then choose a region in x which contains, say, 90% of the probability; this is denoted by the 
solid part of the horizontal line. By repeating this procedure for all possible u, the band between 
the curved lines is constructed. This confidence band contains the likely values of x for any jz. For 
a particular measured value x2, the confidence interval from u; to 4, gives the range of parameter 
values for which that measured value was likely. For x2, this interval would be two-sided, while 
for a lower value xı, an upper limit would be obtained. In contrast, there are no parameter values 
for which xo is likely, and for that measured value the confidence interval would be empty 


the values of x which are likely results? of the experiment for any u, assuming the 
theory is correct (see Fig. 15.1). Then if the actual experiment gives a measurement 
x2, it is merely necessary to find the values of u for which x» is in the confidence 
band. This is the Neyman construction. 

Of course, the choice of a region in x to contain 9096 of the probability is not 
unique. The one shown in Fig. 15.1 is a central one, with 5% of the probability on 
either side of the selected region. Another possibility would be to have a region with 
10% of the probability to the left, and then the region in x extends up to infinity. This 
choice would be appropriate if we always wanted to quote upper limits on jz. Other 
choices of ‘ordering rule’ are also possible (see, for example, Sect. 15.4.3). 

The Neyman construction can be extended to more parameters and measure- 
ments, but in practice it is very hard to use it when more than two or three 
parameters are involved; software to perform a Neyman construction efficiently in 
several dimensions would be very welcome. The choice of ordering rule is also 
very important. Thus from a pragmatic point of view, even ardent frequentists 


3The adjective ‘likely’ is appropriate for central intervals. For upper limits on jz, however, the 
accepted values of x for a given jz extend to infinity, and so ‘preferred results for the given ordering 
rule’ would be more appropriate. 
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are prepared to use Bayesian techniques for multidimensional problems (e.g. with 
systematics). They would, however, like to ensure that the technique they use 
provides parameter intervals with reasonable frequentist coverage. 


15.1.3.4 Coverage 


One of the major advantages of the frequentist Neyman construction is that it 
guarantees coverage. This is a property of a statistical technique^ for calculating 
intervals, and specifies how often the interval contains the true value u; of the 
parameter. This can vary with uz. 

For example, for a Poisson counting experiment with parameter jz and observed 
number n, a (not very good) method for providing an interval for u is n + y/n. Thus 
an observed n = 2 would give a range 0.59-3.41 for u. If u = 2.01, observed 
values n = 2, 3 and 4 result in intervals that include u = 2.01, while other values 
of n do not. The coverage of this procedure for u = 2.01 is thus the sum of the 
Poisson probabilities for having n = 2, 3 or 4 for the given u. 

For a discrete observable (e.g. the number of detected events in a search for Dark 
Matter), there are jumps in the coverage; in order to avoid under-coverage, there 
is necessarily some over-coverage. However, for a continuous observable (e.g. the 
estimated mass of the Higgs boson) the coverage can be exact. 

Coverage is not guaranteed for methods that do not use the Neyman construction 
(see Sect. 15.2.1). Interesting plots of coverage as a function of the parameter value 
for the simple case of a Poisson counting experiment can be found in ref. [32]. 


15.1.3.5 Likelihoods 


The likelihood approach makes use of the probability density function (pdf) for 
observing the data, evaluated for the data actually observed.? It is a function of 
any parameters, although it does not behave like a probability density for them. It 
provides a method for determining values of parameters. These include point esti- 
mates for the ‘best’ values, and ranges (or contours in multi-parameter situations) 
to characterise the uncertainties. It usually has good properties asymptotically, but a 
major use is with sparse multi-dimensional data. 

The likelihood method is neither frequentist nor Bayesian. It thus does not 
guarantee frequentist coverage or Bayesian credibility. It does, however, play a 
central role in the Bayesian approach, which obtains the posterior probability 


^]t is important to realise that coverage is a property of the method, and not of an individual 
measurement. 

5The pdf f (x, mo) gives the probability density for obtaining various data x when the parameter 
has some specified value uo. The likelihood is the same function of two variables f (xo, jc), but 
now with xo fixed at the data actually obtained, and u regarded as the variable. 


652 L. Lyons 


density by multiplying the likelihood by the prior. The Bayesian approach thus 
obeys the likelihood principle, which states that the only way the experimental data 
affects inference is via the likelihood function. In contrast, the Neyman construction 
requires not only the likelihood for the actual data, but also for all possible data that 
might have been observed. 

Because the likelihood is not a probability density, it does not transform like one. 
Thus the value of the likelihood for a parameter mo is identical to that for A9 = 1/149. 
This means that ratios of likelihoods (or differences in their logarithms) are useful to 
consider, but that the integration of tails of likelihoods is not a recognised statistical 
procedure. 

A longer account of the Bayesian and frequentist approaches can be found in 
ref. [28]. Reference [29] provides a very readable account for a Poisson counting 
experiment. 


15.2 Likelihood Issues 


In this section, we discuss some potential misunderstandings of likelihoods. 


15.2.1  A(InL) = 0.5 Rule 


In the maximum likelihood approach to parameter determination, the best value 
Ao of a parameter is determined by finding where the likelihood maximises; and 
its uncertainty is estimated by finding how much the parameter must be changed? 
in order for the logarithm of the likelihood to decrease by 0.5 as compared with the 
maximum. From a frequentist viewpoint, this should ideally result in the parameter 
range having 6896 coverage. That is, in repeated use of this procedure to estimate 
the parameter, 6896 of the intervals should contain the true value of the parameter, 
whatever its true value happens to be. 

If the measurement is distributed about the true value as a Gaussian with constant 
width, the likelihood approach will yield exact coverage, but in general this is not so. 
For example, Garwood[31] and Heinrich[32] have investigated the properties of the 
likelihood approach (and other methods too) to estimate jz, the mean of a Poisson, 
when “ops events are observed. Because ops is a discrete variable, the coverage is 


If there are more than just one parameter, the likelihood must of course be remaximised with 
respect to all the other parameters when looking for the A(/nL) = 0.5 points. Alternatively, a 
region in multi-parameter space can be selected by finding the contour at which A (In L) decreases 
from its maximum by an amount which depends on the number of parameters. 

"This (like several other methods) can give rise to asymmetric uncertainties. Techniques for dealing 
with this have been discussed by Barlow[30]. 
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a discontinuous function of u, and varies from 100% at u = 0 down to 30% at 
u 40.5? 


15.2.2 Unbinned Maximum Likelihood and Goodness of Fit 


With sparse data, the unbinned likelihood method is a good one for estimating 
parameters of a model. In order to understand whether these estimates of the 
parameters are meaningful, we need to know whether the model provides an 
adequate description of the data. Unfortunately, as emphasised by Heinrich[33], 
the magnitude of the unbinned maximum likelihood is often independent of 
whether or not the data agree with the model. He illustrates this by the example 
of the determination of the lifetime t of a particle whose decay distribution is 
(1/1) exp(—t/t). For a set of observed times t;, the maximum likelihood Lmax 
depends on the data t; only through their average value t. Thus any data distributions 
with the same f would give identical Lmax, which demonstrates that, at least in this 
case, Lmax gives no discrimination about whether the data are consistent with the 
expected distribution. 

Another example is fitting an expected distribution (1 + œ cos? @)/(1 + a/3) 
to data 6; on the decay angle of some particle, to determine o. According to 
the expected functional form, the data should be symmetrically distributed about 
cos@ = 0. However, the likelihood depends only on the square of cos0, and 
so would be insensitive to all the data having cos 6; negative; this would be very 
inconsistent with the expected symmetric distribution. 

In contrast Baker and Cousins[34] provide a likelihood method of measuring 
goodness of fit for a data histogram compared to a theory. The Poisson likelihood 
Ppois(n|) for each bin is compared with that for the best possible predicted value 
[best = n for that bin. Thus the Baker-Cousins likelihood ratio 


e "i p; fni! 


LRgc = Ua, = Hen Du; ni) (15.3) 
e "in fni! 


is such that asymptotically —2/n L R gc is distributed as x?.? For small u, the Baker- 
Cousins likelihood ratio is better than a weighted sum of squares for assessing 
goodness of fit. 


8]t is of course not surprising that methods that are expected to have good asymptotic behaviour 
may not display optimal properties for u ^: 0. 
°The binned Poisson likelihood is not a measure of fit. This is because, for example. ui = nj = 


1 and u; = nj; = 100 both correspond to perfect agreement between data and prediction, but 
Ppois(1|1.0) is much larger than Pp,j,(100|100.0). 
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15.2.3 Profile Likelihood 


In many situations the likelihood is a function not only of the parameter of interest $ 
but also other parameters. These may be other physics parameters (for example, in 
neutrino oscillation experiments where the mixing angles and differences in mass- 
squared of the various neutrinos are relevant), but can also be nuisance parameters 
v associated with systematic effects (e.g. jet energy scales, particle identification 
efficiencies, etc.). To make statements about $, the likelihood L(@.v) is often 
‘profiled’ over the nuisance parameters, i.e. at each value of 4, the likelihood is 
remaximised with respect to v. Thus 


Lprof ($) = L(@, Vmax ($)) (15.4) 


Then L prof ($) is used much as the ordinary likelihood when there are no nuisance 
parameters. 

A profile likelihood is in general wider than the likelihood for a fixed value of 
the nuisance parameter v; this results in the uncertainty in the parameter of interest 
¢ being larger when allowance is made for the systematic uncertainties. 

In the standard profile likelihood, v is a continuous variable. An extension of this 
has been used by Dauncey et al. [38], to allow for uncertainties in the choice of 
functional form of the background parametrisation in searches for new particles as 
peaks above background in a mass spectrum. Here the systematic is discrete, rather 
than continuous. 

An alternative way of eliminating nuisance parameters (known as marginalisa- 
tion) is to use L(¢, v) as part of a Bayesian procedure, and than to integrate the 
Bayesian posterior over v. i.e. 


Pmarg() = f Pposr($, v) dv (15.5) 


Of course, both profiling and marginalisation result in the loss of information. 
Reference [37] provides a very trivial example of this for profile likelihoods. 


15.2.4 Punzi Effect 


Sometimes we have two or more nearby peaks, and we try to fit our data in order 
to determine the fractions of each peak. Punzi [39] has pointed out that it is very 
easy to write down a plausible but incorrect likelihood function that gives a biassed 
result. This occurs in situations where the events have experimental resolutions o 
in the observable x that vary event-by-event; and the distributions of o are different 
for the two peaks. 
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For a set of observations x;, it is tempting but wrong to write the unbinned 
likelihood as 


L(f)wrong = IHS * G(xi, 0.0, oi) + (1 — f) * Gxi, 1.0, o1)} (15.6) 


where f is the fraction of the first peak (labelled A below) which is parametrised 
as G(x;, 0.0, oj), a Gaussian in x;, centred on zero, and with width o;, and i is the 
label for the ith event; and similarly for the second peak (labelled B), except that it 
is centred at unity. 

Application of the rules of conditional probability shows that the correct 
likelihood is 


L(f right = Uf * Gj, 0.0, o;i) * p(o;| A) + (1 — f£) * G(x, 1.0, oi) x p(ci|B)] 
(15.7) 


where p(o;|A) and p(o;|B) are the probability densities for the resolution being 
oj for the A and B peaks respectively. We then see that L(f) wrong and L(f )rign: 
give identical values for f, provided that p(o;|A) = p(o;|B). If however, the 
distributions of the resolution differ, L(f)wrong will in general give a biassed 
estimate. 

Punzi investigated the extent of this bias in a simple Monte Carlo simulation, and 
it turns out to be surprisingly large. For example, with f = 1/3, and p(o4) and 
p(og) being ó—functions at 1.0 and at 2.0 respectively (i.e. o = 1 for all A events, 
and o = 2 for all B events), the fitted value of f from L(f)wrong turned out to be 
0.65. Given that f is confined to the range from zero to unity, this is an enormous 
bias. 

The way the bias arises can be understood as follows: The fraction f of the 
events that are really A have relatively good resolution, and so the fit to them alone 
would assign essentially all of them as belonging to A i.e. these events alone would 
give f ~ 1 with a small uncertainty. In contrast the 1 — f of the events that are 
B have poor resolution, so for them the fit does not mind too much what is the 
value of f. But the fit uses all the events together, and so assigns a single f to the 
complete sample; this will be a weighted average of the f values for the A and for 
the B events. Because the A events result in a more accurate determination of f 
than do the B events, the fitted f will be biassed upwards (i.e. it will over-estimate 
the fraction of events corresponding to the peak with the better resolution). 

The Punzi effect can also appear in other situations, such as particle identifica- 
tion. Different particle types (e.g. pions and kaons) would appear as different peaks 
in the relevant particle-identification variable e.g. time of flight, rate of energy loss 
d E/dx, angle of Cherenkov radiation, etc. The separation of these peaks for the 
different particle types depends on the momentum of the particles (see Fig. 15.2). 
The incorrect L is now 


LwrongCfk) = IHA — fr) * GGi, xa (pi), 0i) + fe *G (xi, xk(pi),o))) — (105.8) 
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(a) Low momentum (b) High momentum 


TOF — TOF — 


Fig. 15.2 The Punzi effect in particle identification. The diagrams show the expected (normalised) 
distributions of the output signal from a particle identifier, for pions and for kaons (a) at low 
momentum where separation is easier, and (b) at high momentum where the distributions overlap. 
Because kaons are heavier than pions, they tend to have larger momenta. Because it is hard at high 
momentum to distinguish pions from kaons, the likelihood function is insensitive to whether these 
tracks are classified as pions or kaons, and hence the fraction of high momentum tracks classified as 
kaons will have a large uncertainty. In contrast, low momentum tracks will be correctly identified. 
Thus if the plausible but incorrect likelihood function that ignores the pion and kaon momentum 
distributions is used to determine the overall fraction of kaons, it will be biassed downwards 
towards the fraction of low momentum particles that are kaons 


where x4 (pj) and xx (pi) are the expected positions of the particle identification 
information for a particle of momentum pj, and x; is the observed value for the 
ith event. So here the Punzi bias can arise even with constant resolution, because 
the momentum spectra of pions and kaons can be different. To avoid the bias, the 
likelihood needs to incorporate information on the different momentum distributions 
of pions and of kaons. If these momentum distributions are different enough from 
each other, it could be that the likelihood function bases its separation of the different 
particle types on the momenta of the particles rather than on the data from the 
detector’s particle identifier. Catastini and Punzi[40] avoid this by using parametric 
forms for the momentum distributions of the particles, with the parameters being 
determined by the data being analysed. 

The common feature potentially leading to bias in these two examples is that the 
ratio of peak separation to resolution is different for the two types of objects. For 
the first example of separating the two peaks, it was the denominators that were 
different, while in the particle identification problem it was the numerators. 

The Punzi bias may thus occur in situations where the templates in a multi- 
component fit depend on additional observations whose distributions are not 
explicitly included in the likelihood. 
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15.3 Separating Signal from Background 


Almost every Particle Physics analysis uses some technique for separating possible 
signal from background. First some simple 'cuts' are applied; these are generally 
loose selections on single variables, which are designed to remove a large fraction 
of the background while barely reducing the real or potential signal. Then to 
obtain a better separation of signal from background in the multi-dimensional 
space of the event observables, methods like Fisher discriminants, decision trees, 
artificial neural networks (including Bayesian nets and more recently deep neural 
nets), support vector machines, etc. are used[41, 42]. Extensions of these methods 
involve bagging, boosting and random forests, which have been used to achieve 
improved performance of the separation as seen on a plot of signal efficiency 
against background mis-acceptance rate. A description of the software available for 
implementing some of these techniques can be found in the talks by Narsky [43] and 
by Tegenfeldt[44] at the PHYSTAT-LHC Workshop. 

More recently, deep learning techniques are rapidly becoming popular. In Particle 
Physics, they have been used for on-line triggering, tracking, fast simulation, object 
identification, image recognition, and event-by-event separation of signal from 
background. Reference [45] provides good introductions to the use of these methods 
for Particle Physics. There are now regular workshops and lectures on Machine 
Learning at CERN and at Fermilab (see refs. [46] and [47]), as well as at many 
universities. 

The signal-to-background ratio before this multivariate stage can vary widely, as 
can the signal purity after it. If some large statistics study is being performed (e.g. 
to use a large sample of events to obtain an accurate measurement of the lifetime of 
some particle), then it is not a disaster if there is some level of background in the 
finally selected events, provided that it can be accurately assessed and allowed for in 
the subsequent analysis. At the other extreme, the separation technique may be used 
to see if there is any evidence for the existence of some hypothesised particle (the 
potential signal), in the presence of background from well-known sources. Then the 
actual data may in fact contain no observable signal. 

These techniques are usually ‘taught’ to recognise signal and background by 
being given examples consisting of large numbers of events of each type. These 
may be produced by Monte Carlo simulation, but then there is a problem of trying 
to verify that the simulation is a sufficiently accurate representation of reality. It is 
better to use real data for this, but the difficulty then is to obtain sufficiently pure 
samples of background and signal. Indeed, for the search for a new particle, true 
data examples do not exist. However, it is the accurate representation of background 
that is likely to pose a more serious problem. 

The way that, for example, neural networks are trained is to present the software 
with approximately equal numbers of signal and background events! and then 


!0For searches for rare processes, it is clearly inappropriate to use the actual fractions expected 
in the data to determine the ratio of signal to background Monte Carlo events to be used as the 
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to minimise a cost function C for the network. This is usually defined as C = 
X(zi— ti)”, where z; is the trained network's output for the ith event; t; is the target 
output, usually chosen as 1 for signal and zero for background; and the summation 
is over all testing events presented to the network. The problem with this is that 
C is only loosely related to what we really want to optimise. For a search for 
a new particle this could be the sensitivity of the experimental upper limit in the 
absence of signal, while for a high statistics analysis measuring the properties (such 
as mass or lifetime) of some well-established particle, we would be interested in 
minimising the uncertainty (including systematic effects) on the result, without the 
training procedure biassing the measurement. 

As with all event separation methods. it is essential to check the performance 
of a trained procedure by using a set of events that are independent of those 
used for training. This is to ensure that the network does not use specific but 
irrelevant features of the training events in its learning process, but can achieve 
good performance on unseen data. 

Some open questions are: 


* How can we check that our multi-dimensional training samples for signal and 
background are reliable descriptions of reality; and that they cover the region of 
multi-dimensional space populated by the data? 

* How should the ratio of the numbers of signal and background training events be 
chosen, especially when there are several different sources of background? 

* What is the best way of allowing for nuisance parameters in the models of the 
signal and/or background?[25, 48] 

* Are there useful and easy ways of optimising on what is really of interest?[49] 


15.3.1 Understanding How Neural Networks Operate 


Itis useful to appreciate how neural networks operate in providing a good separation 
of signal and background, as this can help in choosing a suitable architecture for the 
network. 

Figure 15.3a shows some hypothetical signal and background events in terms of 
two measured variables x and y for each event. A network with two inputs (x and y), 
a single hidden layer with 3 nodes, and a single output is used; it aims to give 1 for 
signal and zero for background events (see Fig. 15.3b. This is achieved by training 
the network with (x, y) values for known examples of signal and background; and 
allowing the network to vary its internal parameters to minimise a suitably defined 
cost function e.g. X (Ze — te)’, where the summation is over the training events, and 
Ze and f, are the network’s output and its target value (0 or 1) respectively. 


training sample, because the network could then achieve a very small cost C simply by classifying 
everything as background. 
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Fig. 15.3 (a) A2 — D plot showing the regions of the variables v; and v» for the signal (dots)and 
background (triangles). (b) The neural network used for separating signal and background. (c) The 
top hidden node receives inputs from v, and v2. With suitable weights and threshold and a large 
value of £, the node's output will be on for (v1, v2) values below the diagonal line. (d) Similarly 
for the other two hidden nodes, their outputs can be on for (v1, v2) below the other diagonal line, 
and above the horizontal one, respectively. A further choice of weights to the output node and its 
threshold can ensure that the whole network's output will be on only if all three hidden nodes' 
outputs are on, i.e. if (vi, v2) values are within the triangle in (d) 


The input q; to a given hidden node i is a linear combination of the input variables 
x and y 


qi = WyxiX + Wyiy + fi (15.9) 


where the weights w and threshold f are varied during the fitting process. The output 
r from any hidden node is determined from its input q by something like a sigmoid 
function e.g. 


r=1/(+e 80 (15.10) 
This switches from zero for large negative q to unity for large positive q. The switch 


occurs around q = 0, and width of the region depends on the network parameter f. 
For very large f, there is a rapid switch from zero to unity. In terms of x and y, this 
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means that the hidden node i is ‘on’ (i.e. rj = 1) if 
WyiX + Wyiy + ti > 0, (15.11) 


or ‘off’ otherwise. Thus the boundary between events having r; on or off is a straight 
line in the (x, y) plane (see Fig. 15.3c). With suitable values for the weights and 
thresholds for the three hidden nodes, there will be three straight line boundaries 
in the (x, y) plane shown in Fig. 15.3d. Finally, to produce the “and” of these three 
conditions, the weights wj, (from the hidden node j to the output node o) and the 
output threshold t; can be set as 


Wio = W29 = Wig — 0.4 tp — —1.0 (15.12) 


to ensure that the output will be “on” only if the three hidden layers are all “on”, 
i.e. that the selected input values are inside the triangular region in the (x.y) plane. 
With £ set at a lower level, the contour for the selected region will be smoother with 
rounded corners, rather than being triangular. 

It would be useful to have a similar understanding of how deep networks operate. 
Tishby[50] has provided some insight on what happens in the hidden layers of a deep 
neural network during the training procedure. 


15.4 Parameter Determination 


For a single parameter (e.g. the branching ratio for H — jt i) the parameter 
range could be either a 2-sided interval or just an upper limit, at some confidence 
level (typically 68% for 2-sided intervals, but usually 90% and 95% for upper 
limits). For two parameters (e.g. mass and production rate for some new particle 
X that decays to a top pair), their acceptable values could be those inside some 2- 
dimensional confidence region. Alternatively an upper limit or 2-sided region for 
one parameter as a function of the other could be defined; these are known as a 
Raster Scan. 
An upper limit on 2-variables is not a well-defined concept. 


15.4.1 Upper Limits 


Most recent searches for new phenomena have not found any evidence for exciting 
new physics. Examples from particle physics include searches for SUSY particles, 
dark matter, etc.; attempts to find substructure of quarks or leptons; looking for extra 
spatial dimensions; measuring the mass of the lightest neutrino; etc. Rather than 
just saying that nothing was found, it is more useful to quote an upper limit on the 
sought-for effect, as this could be useful in ruling out some theories. For example in 
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1887, Michelson and Morley[52] attempted to measure the speed of the Earth with 
respect to the aether. No effect was seen, but the experiment was sensitive enough 
to lead to the demise of the aether theory. 

A simple scenario is a counting experiment where a background b is expected 
from conventional sources, together with the possibility of an interesting signal 
s. The number of counts n observed is expected to be Poisson distributed with a 
mean u = es + b, where b is the expected number of events from background, 
and e is a factor for converting the basic physics parameter s into the number of 
signal events expected in our particular experiment; it thus allows for experimental 
inefficiency, the experiment’s running time; etc. Then given a value of n which is 
comparable to the expected background, what can we say about s? The true value of 
the parameter s is constrained to be non-negative. The problem is interesting enough 
if b and e are known exactly; it becomes more complicated when only estimates with 
uncertainties op and o. are available. 

An extension of the simple counting scenario is when a search for a new particle 
is carried out over a range of masses. This is usually dealt with by performing 
separate searches at a series of masses over a specified range. This ‘Raster Scan’ 
is in contrast with a method that regards the sought-for new particle's mass and 
its production rate as two parameters to be estimated simultaneously. The relative 
merits of these two approaches are described in ref. [51]. 

Even without the nuisance parameters, a variety of methods is available. These 
include likelihood, x?, Bayesian with various priors for s, frequentist Neyman 
constructions with a variety of ordering rules for n, and various ad hoc approaches. 
The methods give different upper limits for the same data.!! A comparison of several 
methods can be found in ref. [53]. The largest discrepancies arise when the observed 
n is less than the expected background b, presumably because of a downward 
statistical fluctuation. The following different behaviours of the limit (when n « b) 
can be obtained: 


* Frequentist methods can give empty intervals for s i.e. there are no values of s for 
which the data are likely. Particle physicists tend to be unhappy when their years 
of work result in an empty interval for the parameter of interest, and it is little 
consolation to hear that frequentist statisticians are satisfied with this feature, as 
it does not necessarily lead to undercoverage. 

When n is not quite small enough to result in an empty interval, the upper 
limit might be very small.!? This could confuse people into thinking that the 
experiment was much more sensitive than it really was. 

* The Feldman-Cousins frequentist method[54] (see Sect. 15.4.3) that employs a 
likelihood-ratio ordering rule gives upper limits which decrease as n gets smaller 


!! By coincidence, the upper limits obtained by the Bayesian approach with an (improper) flat prior 
for s and by the appropriate Neyman construction agree when b = 0. 

P Bayesian methods that use priors with part of the probability density being a 5-function at s = 0 
can result is a posterior with an enhanced 5-function at zero, such that the upper limit contains only 
the single point s = 0. 
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at constant b. A related effect is the growth of the limit as b decreases at constant 
n—this can also occur in other frequentist approaches. Thus if no events are 
observed (n = 0), the upper limit of a 90% Feldman-Cousins interval is 1.08 
for b = 3.0, but 2.44 for b = 0. This is sometimes presented as a paradox, in 
that if a bright graduate student worked hard and discovered how to eliminate the 
expected background without much reduction in signal efficiency, the ‘reward’ 
would be a weaker upper limit.? An answer is that although the actual limit 
had increased, the sensitivity of the experiment with the smaller background was 
better. There are other situations—for example, variants of the random choice of 
voltmeter (compare ref. [55])—where a measurement with better sensitivity can 
on occasion give a less precise result. 

In the Bayesian approach, the dependence of the limit on b is weaker. Indeed 
when n = 0, the limit does not depend on b. 

Sen et al. [56] consider a related problem, of a physical non-negative parameter 
à producing a measurement x, which is distributed about X as a Gaussian of 
variance o?. As the observable x becomes more and more negative, the upper 
limit on A increases, because it is deduced that o must in fact be larger than its 
quoted value. 


In trying to assess which of the methods is best, one first needs a list of desirable 


properties. These include: 


Coverage: Even though coverage is a frequentist concept, most Bayesian particle 
physicists would like the coverage of their intervals to match their reported 
credibility, at least approximately. 

Because the data in counting experiments is discrete, it is impossible in any 
sensible way to achieve exact coverage for all jz (see Sect. 15.1.3.4). However, it 
is not completely obvious that even Frequentists need coverage for every possible 
value of u, since different experiments will have different values of b and of e. 
Thus even for a constant value of the physical parameter s, different experiments 
will have different u = € * s + b. Thus it would appear that, if coverage in some 
average (over u) sense were satisfactory, the frequentist requirement for intervals 
to contain the true value at the requisite rate would be maintained. This, however, 
is not the generally accepted view by particle physicists, who would like not to 
undercover for any ju. 

Not too much overcoverage: Because coverage varies with jz, for methods that 
aim not to undercover anywhere, some overcoverage is inevitable. This corre- 
sponds to having some upper limits which are high, and this leads to undesirable 
loss of power in rejecting alternative hypotheses about the parameter's value. 


The n = 0 situation is perhaps a special case, as the number of observed events cannot decrease 
as further selections are imposed to reduce the expected background. For non-zero observed events, 
if n decreases with the tighter cuts (as expected for reduced background), the upper limit is likely 
to go down, in agreement with intuition. But if stays constant, that could be because the observed 
events contain signal, so it is perhaps not surprising that the upper limit increases. 
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* Short and empty intervals: These can be obtained for certain values of the 
observable, without resulting in undercoverage. They are generally regarded as 
undesirable for the reasons explained above. 


It is not obvious how to incorporate the above desiderata on interval length into an 
algorithm that would be useful for choosing among different methods for setting 
limits. For different experiments studying the same phenomena (e.g. Dark Matter 
searches, neutrino oscillation experiments, etc.) it is worthwhile to use the same 
technique for calculating allowed parameter ranges. 


15.4.2 Two-Sided Intervals 


An alternative to giving upper limits is to quote two-sided intervals. For example, a 
68% confidence interval for the mass of the top quark might be 172.6-173.4 GeV/c’, 
as opposed to its 95% upper limit being 173.6 GeV/c?. Most of the difficulties 
and ambiguities mentioned above apply in this case too, together with some extra 
possibilities. Thus, while it is clear which of two possible upper limits is tighter, this 
is not necessarily so for two-sided intervals, where which is shorter may be metric 
dependent; the first of two intervals for a particle’s lifetime t may be shorter, but the 
second may be shorter when the ranges are quoted for its decay rate (= 1/t). There 
is also scope for choice of ordering rule for the frequentist Neyman construction, or 
for choosing the interval from the Bayesian posterior probability density. !4 


15.4.3 Feldman-Cousins Approach 


Feldman and Cousins' fully frequentist approach[54] exploits the freedom available 
in the Neyman construction of how to choose an interval in the data that contains 
a given fraction a of the probability, by using their ‘ordering rule’. This is based 
on the likelihood ratio L(x, “)/L(x, Ubest), Where Apes; is the physically-allowed 
value of u which gives the largest value of L for that particular x. For values of u 
far from a physical boundary, this makes little difference from the standard central 
Neyman construction, but near a boundary the region is altered in such a way as to 
make it unlikely that there will be zero-length or empty intervals for the parameter 
I^; these can occur in the standard Neyman construction (see Fig. 15.4). 


1^ A Bayesian statistician would be happy with the posterior as the final result. Particle physicists 
like to quote an interval as a convenient summary. For a parameter that cannot be negative and 
for which the exclusion of zero is interesting (e.g. testing whether the production rate of some 
hypothesised particle is non-zero), an upper limit would always include zero, a lower limit or a 
central interval would exclude it and a maximum probability density one would not be invariant 
with respect to changes in the functional form of the parameter. 
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Fig. 15.4 The Feldman-Cousins 90% confidence band (solid curves) for the mean jz of a Gaussian 
probability density function of unit variance for a measurement x. The straight dashed lines show 
the confidence band for the central Neyman construction. The Feldman-Cousins ordering rule pulls 
the interval to the left at small jz, and hence, even for negative observed x, the u interval is not 
empty, as happens for central frequentist intervals when x is below —1.6 


The original Feldman-Cousins paper also considered how to extend their method 
when there is more than one parameter and one measurement. They describe an 
idealised neutrino oscillation experiment with the data being the energy spectrum of 
the interacting neutrinos, and the parameters are sin? (20) and Am? (see Eq. 15.15). 
A practical problem of having many parameters is the CPU time required to compute 
the results. 

Feldman and Cousins also point out that an apparently innocuous procedure for 
choosing what result to quote may lead to undercoverage. Many physicists would 
quote an upper limit on any possible signal if their observation was less than 3 
standard deviations above the expected background, but a two-sided interval if 
their result was above this. With each type of interval constructed to give 90% 
coverage, there are some values of the parameter for which the coverage for this 
mixed procedure drops to 85%; Feldman and Cousins refer to this as ‘flip-flop’. 
Their *unified' approach circumvents this problem, as it automatically yields upper 
limits for small values of the data, but two-sided intervals for larger measurements, 
while avoiding undercoverage for all possible true values of the signal. 
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15.4.4 Sensitivity 


It is useful to quote the sensitivity of a procedure, as well as the actual upper 
limit as derived from the observed data.'? For upper limits or for uncertainties on 
measurements, this can be defined as the median value that would be obtained if the 
procedure was repeated a large number of times.!© Using the median is preferable 
to the mean because (a) it is metric independent (i.e. the median lifetime upper limit 
would be the reciprocal of the median decay rate lower limit); and (b) it is much less 
sensitive to a few anomalously large upper limits or uncertainty estimates. 

It is common to present not only the median of the expected distribution, but also 
values corresponding to 16th and 84th percentiles (commonly referred to as +1 o) 
and also the 2.5% and 97.5% ones (2-26). This enables a check to be made that the 
observed result is reasonable. 

Punzi [57] has drawn attention to the fact that this choice of definition for 
sensitivity has some undesirable features. Thus designing an analysis procedure to 
minimise the median upper limit for a search in the absence of a signal provides a 
different optimisation from maximising the median number of standard deviations 
for the significance of a discovery when the signal is present. Also there is only 
a 50% chance of achieving the median result or better. Instead, for pre-defined 
levels o and confidence level CL, Punzi determines at what signal strength there 
is a probability of at least CL for establishing a discovery at a significance level o. 
This is what he quotes as the sensitivity, and is the signal strength at which we are 
sure to be able either to claim a discovery or to exclude its existence. Below this, 
the presence or otherwise of a signal makes too little difference, and we may remain 
uncertain (see Fig. 15.5). 


15.4.5 Nuisance Parameters 


For calculating upper limits in the simple counting experiment described in 
Sect. 15.4.1, the nuisance parameters arise from the uncertainties in the background 
rate b and the acceptance e. These uncertainties are usually quoted as op and oe 
(e.g. b = 3.1 + 0.5), and the question arises of what these uncertainties mean. 
Sometimes they encapsulate the results of a subsidiary measurement, performed to 
estimate b or e, and then they would express the width of the Bayesian posterior 
or of the frequentist interval obtained for the nuisance parameters. However, in 


lThe sensitivity on its own will not do, because it is independent of the data. 

lóInstead of using a large number of simulations in order to extract the median, sometimes the 
‘Asimov’ data set is used. This is the single data set that would be obtained if statistical fluctuations 
were suppressed. i.e. if a model predicted 11.3 events in a particular bin, the Asimov data set for 
that model would contain 11.3 events in that bin. The Asimov data set and the median of the toys 
usually but not always produce similar results. 
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Fig. 15.5 Punzi definition of sensitivity. Expected distributions for a statistic f (which in simple 
cases could be simply the observed number of events n), for Ho = background only (solid curves) 
and for Hı = background plus signal (dashed curves). In (a), the signal strength is very weak, 
and it is impossible to choose between Hp and Hj. As shown in (b), which is for moderate signal 
strength, po is the probability according to Ho of t being equal to or larger than the observed fy. To 
claim a discovery, po should be smaller than some pre-set level o, usually taken to correspond to 
50; terit is the minimum value of t for this to be so. Similarly p; is the probability according to Hj 
for t < to. The power function is the probability according to the alternative hypothesis that ¢ will 
exceed terit. As the separation of the Hg and Hı pdfs increases, so does the power. According 
to Punzi, the sensitivity should be defined as the expected production strength of the signal such 
that the power exceeds another predefined CL, e.g. 95%. The exclusion region corresponds to fo in 
the 596 lower tail of Hı, while the discovery region has fo in the So upper tail of Ho; in (b) there 
is a "No decision" region in between, as the signal strength is below the sensitivity value. The 
sensitivity is thus the signal strength above which there is a 9596 chance of making a 5o discovery. 
i.e. The distributions for Ho and Hi are sufficiently separated that, apart possibly for the 5o upper 
tail of Ho and the 5% lower tail of Hi, they do not overlap. In (c) the signal strength is so large that 
there is no ambiguity in choosing between the hypotheses 


many situations, the uncertainties may involve Monte Carlo simulations, which have 
systematic uncertainties (e.g. related to how well the simulation describes the real 
data) as well as statistical ones; or they may reflect uncertainties or ambiguities 
in theoretical calculations required to derive b and/or e. In the absence of further 
information the posterior is often assumed to be a Gaussian, usually truncated so as 
to exclude unphysical (e.g. negative) values. This may be at best only approximately 
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true, and deviations are likely to be most serious in the tails of the distribution. A 
log-normal or gamma function may be a better choice. 


There are many methods for incorporating nuisance parameters in upper limit 


calculations. These include: 


Profile likelihood (see also Sect. 15.2.3) 

The likelihood, based on the data from the main and from the subsidiary 
measurements, is a function of the parameter of interest s and of the nui- 
sance parameters. The profile likelihood Lprof (s) is simply the full likelihood 
L(s, Dbest (S), Ebest (5)), evaluated at the values of the nuisance parameters that 
maximise the likelihood at each s. Then the profile likelihood is simply used to 
extract the limits on s, much as the ordinary likelihood could be used for the case 
when there are no nuisance parameters. 

Rolke et al. [59] have studied the behaviour of the profile likelihood method 
for limits. Heinrich[32] had shown that the likelihood approach for estimating 
a Poisson parameter (in the absence of both background and of nuisance 
parameters) can have poor coverage at low values of the Poisson parameter. 
However, the profile likelihood seems to do better, probably because the nuisance 
parameters have the effect of smoothing away the fluctuating coverage observed 
by Heinrich. 

Fully Bayesian 

When there is a subsidiary measurement for a nuisance parameter, a prior 
is chosen for b (or €), the data are used to extract the likelihood, and then 
Bayes’ Theorem is used to deduce the posterior for the nuisance parameter. 
This posterior from the subsidiary measurement is then used as the prior for the 
nuisance parameter in the main measurement (this prior could alternatively come 
from information other than a subsidiary measurement); with the prior for s and 
the likelihood for the main measurement, the overall joint posterior for s and 
the nuisance parameter(s) is derived.! This is then integrated over the nuisance 
parameter(s) to determine the posterior for s, from which an upper limit can be 
derived; this procedure is known as marginalisation. 

Numerical examples of upper limits can be found in ref. [60], where a method 
is discussed in detail. Thus assuming (somewhat unrealistically) precisely deter- 
mined backgrounds, the effect of a 10% uncertainty in € can be seen for various 
measured values of n in Table 15.1. A plot of the coverage when the uncertainty 
in € is 20% is reproduced in Fig. 15.6. 

It is not universally appreciated that the choice for the main measurement of a 
truncated Gaussian prior for e and an (improper) constant prior for non-negative 
s results in a posterior for s which diverges[61]. Thus numerical estimates of the 
relevant integrals are meaningless. Another problem comes from the difficulty 
of choosing sensible multi-dimensional priors. Heinrich has pointed out the 


l7This is usually equivalent to starting with a prior for s and the nuisance parameters, and the 
likelihood for the data from the main and the subsidiary experiments together, to obtain the joint 
posterior. 
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Table 15.1 Bayesian 90% confidence level upper limits for the production rate s as a function 
of n, the observed number of events 


n |b =0.0 b = 3.0 

0 | 2.35 (2.30) 2.35 (2.30) 

3 | 6.87 (6.68) 4.46 (4.36) 

6 | 10.88 (10.53) | 7.80 (7.60) 
9 — [1471 (14:20) —. |1L56 (11.21) 
20 | 28.27 (27.05) 25.05 (24.05) 


The Poisson parameter y = € * s + b, where the expected background b is either 0.0 or 3.0, 
and is precisely known; and e, whose true values is 1.0, is estimated in a subsidiary measurement 
with 10% accuracy. The numbers in brackets are the corresponding upper limits when € is known 
precisely. At large n, the limits for b = 3.0 are 3 units lower than those for b = 0.0; the latter are 
approximately n + 1.2847 at large n. The effect of the uncertainty in € is to increase the limits, 
and by a larger amount at large n. For n — 0, these Bayesian limits are independent of the expected 
background b 
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Fig. 15.6 The coverage C for the 90% confidence level upper limit as a function of the true 
parameter 5;;,e, as obtained in a Bayesian approach. The background b = 3.0 is assumed to be 
known exactly, while the subsidiary measurement for € gives a 20% accuracy. The discontinuities 
are a result of the discrete (integer) nature of the measurements. There is no undercoverage 


problems that can arise for the above Poisson counting experiment, when it is 
extended to deal with several data channels simultaneously[62]. 
* Fully frequentist 

In principle, the fully frequentist approach to setting limits when provided 
with data from the main and from subsidiary measurements is straightforward: 
the Neyman construction is performed in the multidimensional space where 
the parameters are s and the nuisance parameters, and the data are from all 
the relevant measurements. Then the region in parameter space for which the 
observed data was likely is projected onto the s-axis, to obtain the confidence 
region for s. 
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In practice there are formidable difficulties in writing a program to do this in 
a reasonable amount of time. Another problem is that, unless a clever ordering 
rule is used for producing the acceptance region in data space for fixed values of 
the parameters, the projection phase leads to overcoverage, which can become 
larger as the number of nuisance parameters increases. Good ordering rules have 
been found for a version of the Poisson counting experiment[63], and also for 
the ratio of Poisson means[64], where the confidence intervals are tighter than 
those obtained by conditioning on the sum of the numbers of counts in the two 
observations. 
For the fully frequentist method, it is guaranteed that there will be no under- 
coverage for any combination of parameter true values. This is not so for any 
other method, and so most particle physicists would like assurance that the 
technique used does indeed provide reasonable coverage, at least for s. There is 
usually lively debate between frequentists and Bayesians as to whether coverage 
is desirable for all values of the nuisance parameter(s), or whether one should 
be happy with no or little undercoverage when experiments are averaged, for 
example, over the nuisance parameter true values. 

* Mixed 
Because of the difficulty of performing a fully frequentist analysis in all but the 
simplest problems, an alternative approach[65] is to use Bayesian averaging over 
the nuisance parameters, but then to employ a frequentist approach for s. The 
hope is that for most experiments setting upper limits, the statistical uncertainties 
on the low n data are relatively large and so, provided the uncertainties in the 
nuisance parameters are not too large, the effect of the systematics on the upper 
limits will not be too dramatic, and an approximate method of dealing with them 
may be reasonable. 
Although such an approach cannot be justified from fundamentals, it provides a 
practical method whose properties can be checked, and is often satisfactory. 


15.4.6 Banff Challenges 


Given the large number of techniques available for extracting upper limits from 
data, especially in the presence of nuisance parameters, it was decided at the 
Banff meeting[6] that it would be useful to compare the properties of the different 
approaches under comparable conditions. This led to the setting up of the “Banff 
Challenge’, which consisted of providing common data sets for anyone to calculate 
their upper limits. This was organised by Joel Heinrich, who reported on the 
performance of the various methods at the PHYSTAT-LHC meeting[66]. 
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At the second Banff meeting[8], the challenge was set by Tom Junk and consisted 
of participants trying to distinguish between histograms, some of which contained 
only background and others which contained a background and signal, which 
appeared as a peak (compare Sect. 15.7.5) 


15.4.7 Recommendations 


It would be incorrect to say that there is one method that must be used. Many Particle 
Physicists’ ideal would be to use a frequentist approach if viable software were 
available for problems with several parameters and items of data. Otherwise they 
would be prepared to settle for a Bayesian approach, with studies of the sensitivity of 
the upper limit to the choice of priors, and of the coverage; or for a profile likelihood 
method, again with coverage studies. What is important is that the procedure should 
be fully defined before the data are analysed; and that when the experimental result 
and the sensitivity of the search are reported, the method used should be fully 
explained. 

The CDF Statistics Committee [67] also suggests that it is useful to use 
a technique that has been employed by other experiments studying the same 
phenomenon; this makes for easier comparison. They tend to favour a Bayesian 
approach, chiefly because of the ease of incorporating nuisance parameters. 


15.5 Combining Results 


This section deals with the combination of the results from two or more mea- 
surements of a single (or several) parameters of interest. It is not possible to 
combine upper limits (UL). This is because an 84% UL of 1.5 could come from 
a measurement of 1.4 + 0.1, or 0.5 + 1.0; these would give very different results 
when combined with some other measurement. 

The combination of p-values is discussed in Sect. 15.7.9. 


15.5.1 Single Parameter 


An interesting question is whether it is possible to combine two measurements 
of a single quantity, each with uncertainty +10, such that the uncertainty on the 
combined best estimate is +1? The answer can be deduced later. 
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To combine N different uncorrelated measurements a; + o; of the same physical 
quantity a!8 when the measurements are believed to be Gaussian distributed about 
the true value a;,,e, the well-known result is that the best estimate acomb £ Ocomb 1S 
given by 


Acomb = X (ai x wi)/ Xwi, Ocomb = 1 Xwi, (15.13) 
where the weights are defined as w; — 1 js. This is readily derived from 


minimising with respect to a a weighted sum of squared deviations 
S(a) = X (a; — a /o2 (15.14) 


The extension to the case where the individual measurements are correlated 
(as is often the case for analyses using different techniques on the same data) 
is straightforward: S(a) becomes ZX(a; — a) x Hij x (aj — a), where H is the 
inverse covariance matrix for the aj. It provides Best Linear Unbiassed Estimates 
(BLUE)[70]. 

There are, however, practical details that complicate its application. For example, 
in the above formula, the o; are supposed to be the true accuracies of the 
measurements. Often, all that we have available are estimates of their values. 
Problems arise in situations where the uncertainty estimate depends on the measured 
value aj. For example, in counting experiments with Poisson statistics, it is typical 
to set the uncertainty as the square root of the observed number. Then a downward 
fluctuation in the observation results in an overestimated weight, and degmp is 
biassed downwards. If instead the uncertainty is estimated as the square root of 
the expected number a, the combined result is biassed upwards—the increased 
uncertainty reduces S at larger a. A way round this difficulty has been suggested 
by Lyons et al. [71]. Alternatively, for Poisson counting data a likelihood approach 
is preferable to a x?-based method. 

Another problem arises when the individual measurements are very correlated. 
When the correlation coefficient of two uncertainties is larger than o1 /o2 (where o1 
is the smaller uncertainty), acomb lies outside the range of the two measurements. 
As the correlation coefficient tends to +1, the extrapolation becomes larger, and is 
sensitive to the exact values assumed for the elements of the covariance matrix. The 
situation is aggravated by the fact that ocomp tends to zero. This is usually dealt with 
by selecting one of the two analyses, rather than trying to combine them. However, 
if the estimated uncertainty increases with the estimated value, choosing the result 
with the smaller estimated uncertainty can again produce a downward bias. On the 
other hand, using the smaller expected uncertainty can cause us to ignore an analysis 
which had a particularly favourable statistical fluctuation, which produced a result 


I8]t is of course much better to use all the data in a combined analysis, rather than simply to 
combine the results. 
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that was genuinely more precise than expected? How to deal with this situation in 
general is an open question. It has features in common with the problem (inspired by 
ref. [55]) of measuring a voltage by choosing at random a voltmeter from a cupboard 
containing meters of different sensitivities. 

Another example involves combining two measurements of a cross-section 
with small statistical uncertainties, but with large correlated uncertainties from the 
common luminosity. With this luminosity uncertainty included in the covariance 
matrix, BLUE can result in the combined value being outside the range of the 
individual measurements. For this situation, it is preferable to exclude the luminosity 
uncertainty from the covariance matrix, and to apply it to the combined result 
afterwards. 


15.5.2 Two or More Parameters 


An extension of this procedure is for combining N pairs of correlated measurements 
(e.g. the gradient and intercept of a straight line fit to several sets of data, where for 
simplicity it is assumed that any pair is independent of every other pair). For several 
pairs of values (aj, bj) with inverse covariance matrices M;, the best combined 
values (acomp, Bcomp) have as their inverse covariance matrix M = Mj. This 
means that, if the covariance matrix correlation coefficients p; of the different 
measurements are very different from each other, the uncertainty on dcomp can be 
much smaller than that for any single measurement. 

This situation applies for track fitting to hits in a series of groups of tracking 
chambers, where each set of close chambers provides a very poor determination 
of the track; but the combination involves widely spaced chambers and determines 
the track well. Using the profile likelihoods (e.g. for the intercept, profiled over the 
gradient) for combining different measurements loses the correlation information 
and can lead to a very poor combined estimate[37]. The alternative of ignoring the 
correlation information is also strongly discouraged. 

The importance of retaining covariances is relevant for many combinations, e.g. 
for the determination of the amount of Dark Energy in the Universe from various 
cosmological data[73]. 


'9For example, the ALEPH experiment at LEP produced a tighter-than-expected upper limit on 
the mass of v, because they happened to observe t decay configurations which were particularly 
sensitive to the v; mass. 
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15.5.3 Data Consistency 


The standard procedure for combining data pays no attention to whether or not 
the data are consistent. If they are clearly inconsistent, then they should not all 
be combined. When they are somewhat inconsistent, the procedure adopted by the 
Particle Data Group[14] is to increase all the uncertainties by a common factor such 
that the overall x? per degree of freedom equals unity.”° 

The Particle Data Group prescription for expanding uncertainties in the case of 
discrepant data sets has complications when each of the data sets consists of two or 
more parameters[72]. 


15.6 Goodness of Fit 


15.6.1 Sparse Multi-Dimensional Data 


The standard method loved by most scientists uses the weighted sum of squares, 
commonly called x7. This, however, is only applicable to binned data (i.e. in a one 
or more dimensional histogram). Furthermore it loses its attractive feature that its 
distribution is model-independent when there is not enough data, which is likely to 
be so in the multi-dimensional case. 

Although the maximum likelihood method is very useful for parameter determi- 
nation with unbinned data, the value of Lmax usually does not provide a measure 
of goodness of fit (see Sect. 15.2.2). 

An alternative that is used for sparse one-dimensional data is the Kolmogorov- 
Smirnov (KS) approach[68], or one of its variants. However, in the presence of fitted 
parameters, simulation is again required to determine the expected distribution of 
the KS-distance. Also because of the problem of how to order the data, the way to 
use it in multi-dimensional situations is not unique. 

The standard KS method uses the maximum deviation between two cumulative 
distributions; because of statistical fluctuations, this is likely to occur near the 
middle of the distributions. In cases where interesting New Physics is expected to 
occur at extreme values of some kinematic variable (e.g. pr), variants of KS such as 
Anderson-Darling[69] that give extra weight to the distributions’ tails may be more 
useful. 


20This is somewhat conservative, in that even if there are no problems, about half the data sets 
would be expected to have this larger than unity. 
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15.6.2 Number of Degrees of Freedom 


If we construct the weighted sum of squares S between a predicted theoretical curve 
and some data in the form of a histogram, provided the Poisson distribution of the 
bin contents can be approximated by a Gaussian (and the theory is correct, the data 
are unbiased, the uncertainty estimates are correct, etc.), asymptotically?! S will 
be distributed as x? with the number of degrees of freedom v = n — f, where n is 
the number of data points and f is the number of free parameters whose values are 
determined by minimising S. 

The relevance of the asymptotic requirement can be seen by imagining fitting a 
more or less flat distribution by the expression N(1 + 107 cos(x — xo)), where the 
free parameters are the normalisation N and the phase xo. It is clear that, although 
xo is left free in the fit, because of the 1076 factor, it will have a negligible effect 
on the fitted curve, and hence will not result in the typical reduction in S associated 
with having an extra free parameter. Of course, with an enormous amount of data, 
we would have sensitivity to xo, and so asymptotically it does reduce v by one unit, 
but not for smaller amounts of data. 

Another example involves neutrino oscillation experiments[54]. In a simplified 
two neutrino scenario, the neutrino energy spectrum is fitted by a survival probabil- 
ity P of the form 


P = 1 — sin? 20 sin?(C * Am’), (15.15) 


where C is a known function of the neutrino energy and the length of its flight path, 
Am? is the difference in mass squared of the relevant neutrino species, and 8 is the 
neutrino mixing angle. For small values of C x Am?, this reduces to 


P ~ 1 — sin? 20 (C * Am?)* (15.16) 


Thus the survival probability depends on the two parameters only via their product 
sin 20 Am. Because this combination is all that we can hope to determine, we 
effectively have only one free parameter rather than two. Of course, an enormous 
amount of data can manage to distinguish between sin(C * Am?) and C x Am?, and 
so asymptotically we have two free parameters as expected. 


15.7 Discovery Issues 


Searches for new particles are an exciting endeavour, and continue to play a large 
role at the LHC at CERN, in neutrino experiments, in searches for dark matter, etc. 
The 2007 and 2011 PHYSTAT Workshops at CERN[7, 9] were devoted specifically 


?'The examples in this section go beyond the requirement that we need enough events for the 
Poisson distribution to be well approximated by a Gaussian. 
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to statistical issues that arise in discovery-orientated analyses at the LHC. Ref 
[74] deals with statistical issues that occur in Particle Physics searches for new 
phenomena; as an example, it includes the successful search for the Higgs boson 
at the LHC. A more detailed description of the plans for the Higgs search before its 
discovery is in ref. [75]. 


15.7.1 Ho, or Ho Versus Hy? 


In looking for new physics, there are two distinct types of approach. We can compare 
our data just with the null hypothesis Ho, the SM of Particle Physics; alternatively 
we can see whether our data are more consistent with Ho or with an alternative 
hypothesis Hj, some specific manifestation of new physics, such as a particular 
form of quark and/or lepton substructure. The former is known as 'goodness of fit', 
while the term ‘hypothesis testing’ is often reserved for the latter. 

Each of these approaches has its own advantage. By not specifying a specific 
alternative,” the goodness of fit test may be capable of detecting any form of 
deviation from the SM. On the other hand, if we are searching for some specific 
new effect, a comparison of Ho and H is likely to be a more sensitive way for that 
particular alternative. Also, the ‘hypothesis testing’ approach is less likely to give a 
false discovery claim if the assumed form of Ho has been slightly mis-modelled. 


15.7.2 p-Values 


In order to quantify the chance of the observed effect being due to an uninteresting 
statistical fluctuation, some statistic is chosen for the data. The simplest case would 
be the observed number no of interesting events. Then the p-value is calculated, 
which is simply the probability that, given the expected background rate b from 
known sources, the observed value would fluctuate up to ng or larger. In more 
complicated examples involving several relevant observables, the data statistic may 
be a likelihood ratio Lo/L for the likelihood of the null hypothesis Ho compared 
with that for a specific alternative Hj. 

To compute the p-value of the observed or of possible data, the distribution f (t) 
of the data statistic f under the relevant hypothesis is required. In some cases this 
can be obtained analytically, but in more complicated situations, f(t) may require 
simulation. For t being —2 In Lo/Lpe;;, Cowan et al have given useful asymptotic 


>? Rven a test of the null hypothesis may not be completely independent of ideas about alternatives. 
Thus in an event counting experiment, new physics usually results in an increase in rate, unless 
we are looking for neutrino oscillations, in which case a decrease would be significant. Also, 
sometimes the statistic used for a goodness of fit test of Ho may be the likelihood ratio for Ho as 
compared with a specific alternative Hj. 
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formulae for f (t)[76]; here Lyes; is the value of the likelihood when the parameters 
in Hp are set at their best values. 

A small value of p indicates that the data are not very compatible with the theory 
(which may be because the detector's response or the background is poorly modeled, 
rather than the theory being wrong). 

Particle Physicists usually convert p into the number of standard deviations o 
of a Gaussian distribution, beyond which the one-sided tail area corresponds to p; 
statisticians refer to this as the z-score, but physicists call it significance. Thus 50 
corresponds to a p-value of 3 * 1077. This is done simply because it provides a 
number which is easier to remember, and not because Gaussians are relevant for 
every situation. 

Unfortunately, p-values are often misinterpreted as the probability of the theory 
being true, given the data. It sometimes helps colleagues clarify the difference 
between p(A|B) and p(B|A) by reminding them that the probability of being 
pregnant, given the fact that you are female, is considerably smaller than the 
probability of being female, given the fact that you are pregnant. Reference [77] 
contains a series of articles by statisticians on the use (and misuse) of p-values. 

Sometimes S/4/ B or S/A/(S + B) or the like (where S is the number of observed 
events above the estimated background B) is used as an approximate measure of 
significance. These approximations can be very poor, and their use is in general not 
recommended.^? 


15.73 CL; 


This is a technique[58] which is used for situations in which a discovery is not 
made, and instead various parameter values are excluded. For example the failure to 
observe SUSY particles can be converted into mass ranges which are excluded (at 
some confidence level). 

Figure 15.5 (again) illustrates the expected distributions for some suitably chosen 
statistic f under two different hypotheses: the null Ho in which there is only standard 
known physics, and H4 which also includes some specific new particle, such as 
a SUSY neutralino. In Fig. 15.5c, the new particle is produced prolifically, and 
an experimental observation of t should fall in one peak or the other, and easily 
distinguishes between the two hypotheses. In contrast, Fig. 15.5a corresponds to 
very weak production of the new particle and it is almost impossible to know 
whether the new particle is being produced or not. 


3For example, if selections to enhance signal with respect to background were optimised using 
S/4/ B, extremely hard cuts might be chosen, yielding expected numbers of events $ = 0.1 and 
B = 107°. This results in S /VB = 10, which sounds very good, but in fact this selection is 
disastrous. 
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The conventional method of claiming new particle production would be if the 
observed : fell well above the main peak of the Ho distribution; typically a po value 
corresponding to 50 would be required (see Sect. 15.7.7). In a similar way, new 
particle production would be excluded if t were below the main part of the Hj 
distribution. Typically a 95% exclusion region would be chosen (i.e. p; < 0.05), 
where p; is by convention the left-hand tail of the Hj distribution, as shown in 
Fig. 15.5b. 

The CLs method aims to provide protection against a downward fluctuation of 
t in Fig. 15.5a resulting in a claim of exclusion in a situation where the experiment 
has no sensitivity to the production of the new particle; this could happen in 596 of 
experiments. It achieves this by defining? 


CL; — pi/(1 — po). (15.17) 


and requiring C Ls to be below 0.05. From its definition, it is clear that CLs cannot 
be smaller than pı, and hence is a conservative version of the frequentist quantity 
pi. It tends to pj when f lies above the Ho distribution, and to unity when the Ho 
and H distributions are very similar. The reduced C Ls exclusion region is shown by 
the dotted diagonal line in Fig. 15.7; the price to pay for the protection provided by 
CL, is that there is built-in conservatism when p, is small but po has intermediate 
values i.e. there are more cases in which no decision is made. Most statisticians are 
appalled by the use of CL,, because they consider that it is meaningless to take the 
ratio of two p-values. 

It is deemed not to be necessary to protect against statistical fluctuations giving 
rise to discovery claims in situations with no sensitivity, because that should happen 
only at the 3 x 1077 rate (the one-sided 5o Gaussian tail area). 

Figure 15.7 is also useful for understanding the Punzi sensitivity definition (see 
Sect. 15.4.4). For any specified distributions of the statistic t for Hy and Hj, the 
possible (po, p1) values lie on a curve or straight line which extends from (0,1) to 
(1,0). With more data, the ¢ distributions separate, and the curve moves closer to the 
po and p; axes. The amount of data required to satisfy the Punzi requirement of 
always claiming a discovery or an exclusion is when no part of the curve is in the 
“no decision” region of Fig. 15.7. 


24Given the fact that CL, is the ratio of two p-values, the choice of symbol CL, (standing for 
‘confidence level of signal’) is not optimal. Another source of confusion is that in definitions of 
CL; the ways the p-values are defined vary, so the formulae can look different but the underlying 
concept is the same. 

A subtlety with Eq. (15.17) is that po there is the probability of obtaining a measurement 
greater than the observed one, rather than the usual ‘greater than or equal to’. This is to make 
1 — po the probability of a value smaller than or equal to the observed one, in analogy with the 
definition of pı. It makes a difference when the observation is a small discrete number. 
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Hypothesis Test: Gauss(iu,0) vs. Gauss(u,,c) 
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Fig. 15.7 Plot of po against p; for comparing a data statistic f with two hypotheses Ho and H1, 
whose expected pdf's for t are given by two Gaussians of peak separation Ay, and of equal 
width c. For a given pair of pdf’s for t, the allowed values of (po, pı) lie on a curve or straight 
line (shown solid in the diagram). The expected density for the data along a curve is such that its 
projection along the po-axis (or p;-axis) is expected to be uniform for the hypothesis Ho (or Hi 
respectively). As the separation increases, the curves approach the po and pı axes. Rejection of 
Ho is for po less than, say, 3 « 1077; here it is shown as 0.05 for ease of visualisation. Similarly 
exclusion of H is shown as p, < 0.1. Thus the (po, pı) square is divided into four regions: the 
largest rectangle is when there is no decision, the long one above the po-axis is for exclusion of H1, 
the high one beside the p,-axis is for rejection of Ho, and the smallest rectangle is when the data 
lie between the two pdf’s. For Au/o = 3.33, there are no values of (po, pı) in the “no decision” 
region. In the CL; procedure, rejection of Hı is when the t statistic is such that (po, pı) lies below 
the diagonal dotted straight line 


15.7.4 Comparing Two Hypotheses Via x? 


Assume that there is a histogram with 100 bins, and that a x? method is being used 
for fitting it with a function with one free parameter. The expected value of x? is 
99 + 14. Thus if po, the best value of the parameter, yields a x? of 85, this would be 
regarded as very satisfactory. However, a theoretical colleague has a model which 
predicts that the parameter should have a different value pı, and wants to know what 
the data have to say about that. This is tested by calculating the x? for that pı, which 
yields a value of 110. There appear to be two contradictory conclusions: 


e p; is satisfactory: This is based on the fact that the relevant x? of 110 is well 
within the expected range of 99 + 14. 
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* pı is ruled out: The uncertainty on p is estimated by seeing how much it must 
change from its optimum value in order to make x? increase by 1. For this data, 
X^(pi) is 25 units larger than x?(po), and so, assuming that the behaviour of 
X? in the neighbourhood of the minimum is parabolic, p; is ruled out at the ~5 
standard deviation level. 


Unfortunately, many physicists, over-impressed by the fact that x2(p1) appears 
to be satisfactory, are reluctant to accept that po is strongly favoured by the data. 

A similar argument applies to comparing a given set of data with 2 separate 
hypotheses e.g. fitting a histogram with an exponential or a straight line. Again the 
difference between the x? quantities provides better discrimination between the 
hypotheses than do the individual x? values[78]. Another example of using the 
difference in x?'s is given in the next section. 

There are of course other ways available for comparing two hypotheses. e.g. 
likelihood ratio, Bayes factor, Bayesian information criterion, etc. For a fuller 
discussion, see ref. [79]. A description of their application in cosmology can be 
found in ref. [80]. Problems in choosing priors for the Bayes factor approach for 
selecting among hypotheses are discussed by Heinrich[8 1]. 


15.7.5 Peak Above Smooth Background 


When comparing two hypotheses with our data, we can use the numerical values of 
the two x? quantities with a view to making some decision about the hypotheses. 
For example, we may be fitting a smooth distribution by a power series, and wonder 
whether we need a quadratic term, or whether a linear expression would suffice. 
Alternatively we may want to assess whether a mass spectrum favours the existence 
of a peak on top of a smooth background, as compared with just the smooth 
background. Qualitatively, if the extra term(s) are unnecessary, they will result in 
a relatively small reduction in x?, while if they really are required, the reduction 
could be larger. 

It is sometimes possible to be quantitative about the expected reduction when 
the extra terms are not needed[82]. If we are in the asymptotic regime, and if the 
hypotheses are nested,’ and if the extra parameters of the larger hypothesis are 
defined under the smaller one, and in that case do not lie on the boundary of their 
allowed region, then the difference in x? should itself be distributed as a ye with 
the number of degrees of freedom equal to the number of extra parameters. 

An example that satisfies this is provided by the different order polynomials. The 
hypotheses are nested, in that the linear situation is a special case of a quadratic, 
where the coefficient of the quadratic term is zero. Thus the extra parameter is 
defined and within the (infinite) allowed range. Then, provided we have a large 


25This means that for suitable values of the parameters the larger hypothesis reduces to the smaller 
one. 
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amount of data, we expect the difference in x? to have one degree of freedom, so a 
value larger than around 5 would be unlikely. 

A contrast is provided by a smooth background C(x) compared with a back- 
ground plus peak, C(x) + A exp [—0.5 * (x — xo)? /o?]. The extra parameters for 
the peak are its amplitude, position and width: A, xo and o respectively. Again the 
hypotheses are nested, in that C (x) is just a special case of the peak plus background, 
with A — 0. However, although A is defined in the background only case, xo and 
o are not, as their values become completely irrelevant when A = 0. Furthermore, 
unless the peak plus background fit allows A to be negative, zero is on the boundary 
of its allowed region. We thus should not expect the difference of the x? quantities 
itself to be distributed as a x? [83-85]. To assess the significance of a particular x! 
difference, this unfortunately means that we have to obtain its distribution ourselves, 
presumably by Monte Carlo. If we want to find out probabilities of statistical 
fluctuations at the 1076 level, this requires a lot of simulation, and probably needs 
us to use something better than brute force. 

The problem of non-standard limiting distributions for x? tests has a substantial 
statistical literature (see, for example, refs. [86] and [87].) 


15.7.6 Incorporating Nuisance Parameters 


The calculation of p-values is complicated in practice by the existence of nuisance 
parameters. (For the simple situation described in Sect. 15.7.2, there could be 
some uncertainty in the estimated background.) There are numerous ways of 
incorporating them. These include: 


* Conditioning: For example, with a single nuisance parameter, it may be possible 
to condition on the sum of the number of counts in the main and the subsidiary 
experiments, and then to use the binomial distribution to obtain the p-value. 

* Plug-in p-value: The best estimate of the nuisance parameter under the null 
hypothesis is used to calculate p. 

* Prior predictive p-value: The p-values are averaged over the nuisance parame- 
ters, weighted by their prior distributions. This is in the spirit of the Cousins and 
Highland approach[65] for upper limits. 

* Posterior predictive p-value: This time, the posterior distributions of the nuisance 
parameters are used for weighting. 

* Supremum p-value: The largest p-value for any possible value of the nuisance 
parameter is used. This is likely to be useful only when the nuisance parameter is 
forced to be within some range; or when there is only a small number of possible 
alternative theoretical interpretations. 

* Confidence interval: A region of frequentist confidence 1 — y is used for the 
nuisance parameter(s), and then the adjusted p-value is Pmax + y, where Pmax 
is the largest p-value as the nuisance parameters are varied over their confidence 
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region. Clearly if it is desired to establish a discovery from p-values around 1077 
or smaller, then y should be chosen at least an order of magnitude below this. 


The properties of these and other methods are compared by Demortier [84], while 
Cranmer [88] and Cousins et al.[89] have discussed some of them in the context of 
searches at the LHC. 

The role of systematic effects is likely to be more serious here than for upper 
limits discussed in Sect. 15.4.5. This is because in upper limit situations the number 
of events is usually small, and so statistical uncertainties dominate. In contrast, 
discovery claims have p-values of 3 x 1077 or smaller, and so tails of distributions 
are likely to be important. 


15.7.7 Why 5o? 


Unfortunately the usually accepted criterion for claiming a discovery in Particle 
Physics is that p should correspond to at least 50. Statisticians almost invariably 
ask why such a stringent level is used. One answer is past experience: all too often 
interesting effects at the 3o or 4c level have gone away as more data are collected. 
Another is the multiple comparison problem, or “Look Elsewhere Effect" (LEE). 
While the chance of obtaining a 5o effect in one bin of a particular histogram 
(‘local p-value”) is really small, it is to be remembered that histograms have many 
bins,"Ó they could be plotted with different selection criteria and different binning,” 
and there are very many other histograms that were or could have been looked at 
in the course of the experiment.?? Thus the chance of a 5o fluctuation occurring 
somewhere in the data (“global p-value") is much larger than might at first appear. 
Calculating a global p-value may require an excessive amount of Monte-Carlo 
simulation. Reference [90] circumvents this for asymptotic situations by providing a 
formula for extrapolating the LEE correction factor from a lower significance level; 
this requires considerably less simulation. 

Finally, physicists subconsciously incorporate Bayesian priors in assessing how 
likely they feel that they have discovered something new, and hence whether they 


?6Tn calculating a p-value in such a case, it is very desirable to take into account the number of 
chances for a statistical fluctuation to occur anywhere in the histogram (or anywhere in the search 
procedure, for more complicated analyses). At very least, it should be made clear what the basis of 
the calculated p-value is. 


?Tf a blind analysis is performed, such decisions are made before looking at the data, and so this 
aspect of the “look elsewhere" effect is reduced. 


28The extent to which other people's searches should be included in an allowance for the “look 
elsewhere" effect depends on the implied question being addressed. Thus are we considering the 
chance of obtaining a statistical fluctuation in any of the analyses we have performed; or by 
anyone analysing data in our experiment; or by any Particle Physicist this year? Because of the 
ambiguity of which specific question is being addressed, which is often not explicitly mentioned, 
we recommend not including an extra “look elsewhere" factor for this. 
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should claim a discovery. Thus, in deciding between the possibilities of a new 
discovery or of an undetected systematic effect, our priors might favour the latter, 
and hence strong evidence for discovery is required from the data.?? 

However it is not necessarily equitable to use a uniform standard for large 
general-purpose experiments and for small ones with a specific aim; or for looking 
for a process which is expected (e.g. H? — yt), as compared with a more 
speculative search, such as lepton substructure[91]. But physicists and especially 
journal editors seem to like a defined rule rather than a flexible criterion, so this 
bolsters the 5o0 standard. In any case, it is largely a semantic issue, in that physicists 
finding a 4.50 effect would clearly report it, using judiciously chosen wording to 
describe the interpretation of their observation. 

Statisticians also ask whether models can really be trusted to describe the extreme 
tails of distributions. In general, this may be so—counting experiments are expected 
to follow Poisson distributions, with small corrections for possible long time- 
scale drifts in detector calibrations; and particle decays usually are described by 
exponential distributions in time. However, the situation is much less clear for 
nuisance parameters, where uncertainty estimates may be less rigorous, and their 
distribution is often assumed to be Gaussian (or truncated Gaussian) by default. The 
effect of these uncertainties on very small p-values needs to be investigated case- 
by-case. 

It is important to remember that p-values merely test the null hypothesis. There 
are more sensitive ways of looking for new physics when a specific alternative is 
relevant. Thus a very small p-value on its own is usually not enough to make a 
convincing case for discovery. 


15.7.8  Repetitions in Time 


Often experiments accumulate data over several years. The same search for a new 
effect may typically be repeated once or twice each year as more data are collected. 
Does this constitute another factor of ~20 in the number of opportunities for a 
statistical fluctuation to appear? Our reply is “No”. If there had been a 66 signal 
with the early data (which resulted in a claim for discovery), which had then become 
only 3c with more data, this would be grounds for downplaying the earlier discovery 
claim. Thus at any time, there is essentially only one set of data (everything) that is 
relevant. 

For a p-value to be meaningful, it is important that the time at which the 
experiment stops collecting data is determined not by the significance of the 
observed signal but by external factors (e.g. accelerator being decommissioned, 
ending of funding, etc.). Indeed there is a theorem that states that, provided data is 


?9]f I were performing an experiment to look for violations of energy conservation, I would require 
more than 50, because my prior for energy being conserved is very large. 
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collected for long enough, it is possible to reach any arbitrary level of significance 
against a hypothesis that is in fact true. 


15.7.9 Combining p-Values 


In looking for a given new effect, there may be several separate and uncorrelated 
analyses which are relevant. These could correspond to different decay modes for 
the new particle; or different experiments looking for the same signal. Thus, if the 
p-values for the null hypothesis (i.e. no new physics) for the separate analyses were 
107 and 0.1, what is the corresponding p-value for the pair of results??? 

The unambiguous answer is that there is no unique recipe for combining them[92, 
93]. There is no single way of taking a uniform distribution in two variables, and 
finding a transformation Peomp(p1, p2) that converts it into a uniform distribution 
of the single variable Peomb- 

Two popular recipes involve asking what is the probability that the smaller p- 
value will be 1076 or smaller; or that the product is below pı * p2 = 1077. (Note 
that these probabilities are not 107 and 107 respectively.) None of the possible 
methods has the property that in combining three p-values, the same answer is 
obtained if p, is first combined with p», and then the result is combined with p3; or 
whether some different ordering is used. 

Another problem is the lack of other information that might be relevant. For 
example, the p-values might arise from x?'s with different numbers of degrees of 
freedom v e.g. x? = 90 for 100 degrees of freedom, and xà = 20 for v = 1. 
The second has a very small p-value, so many combination methods (including the 
two mentioned above) would conclude that overall the data do not look consistent 
with the null hypothesis. However, another plausible-sounding method is to add 
the separate x? values and also the individual v,?! to obtain a total x? = 110 for 
v = 101, which sounds perfectly satisfactory. The resolution of this discrepancy 
of interpretation depends on the nature of the two tests. If the second analysis with 
X? — 20 corresponded to just one extra measurement like the previous 100, then it 
seems reasonable to combine the x? values and the v, and to conclude that overall 
there is indeed nothing surprising. But on the other hand, if the second measurement 
was genuinely different, and an alternative way of looking for some discrepancy, 
then it may be more appropriate to combine the p-values by one of the earlier 
methods, which suggest that the overall consistency with theory is not good. It 


30Rather than combining p-values, it is of course much better to use the complete sets of original 
data (if available) for obtaining the combined result. 

31 The method described earlier involving the product of the p-values is equivalent to converting 
each p to a x?, assuming that v — 2, regardless of whether this was the actual number of degrees 
of freedom, and then adding the x? and also the v. 
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is this extra information about the nature of the two tests that determines which 
combination method might be appropriate. 

It is clearly important to decide in advance what combination method should be 
used, without reference to the specific data being analysed. 


15.8 Blind Analyses 


These are becoming increasingly popular as a means of avoiding personal bias 
affecting the result. They involve keeping part of the data unseen by the analysers, 
until the data selection procedure and the analysis method have been completely 
defined, all correction procedures specified, etc. 

One of the early suggestions to use a blind analysis in a Particle Physics 
experiment was due to Luis Alvarez. An experiment at Stanford had looked for 
quarks, by measuring the residual charge on small spheres that were levitated in 
a superconducting magnet. If a single free quark were present in a sphere, the 
residual charge would be a third or two-thirds of the electron’s charge. Several of 
the balls tested indeed yielded such values[94]. A potential problem was that large 
corrections had to be applied to the raw data in order to extract the final result for 
the charge. The suspicion was that maybe the experimenters were (subconsciously) 
applying corrections until the value turned out to be ‘satisfactory’. The blind 
approach involved the computer adding a random number to the raw value of the 
charge, which would then be corrected until the experimentalists were satisfied, and 
only then would the computer subtract the random number to reveal the final answer 
for that sphere.?? 

There are various methods of performing blind analyses[95] most of which aim 
to allow the experimentalists to look at some of the real data, in order to perform 
checks that nothing is terribly wrong. Some of these are: 


* The computer adds a random number to the data, which is only subtracted after 
all corrections are applied. This was the method suggested by Alvarez. 

* Useonly Monte Carlo to define the procedure. This completely avoids the danger 
of allowing the data to determine the procedure to be used, but suffers from the 
drawback that the data cannot be compared with the Monte Carlo, to check that 
the latter is reasonable. 

* Use only a fraction of the data for defining the procedure, which then is held 
fixed for the remainder of the data. In principle, an optimisation can be employed 
to determine the fraction to be kept open, but in practice this is often decided by 
choosing a semi-arbitrary time after which the future data is kept blind. 


32This suggestion was implemented, but in fact no subsequent results were published. The current 
consensus is that this ‘discovery’ of free quarks is probably spurious. 
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* The signal region is defined by a certain part of multi-dimensional space, and this 
is kept hidden, but all other regions, including those adjacent to the signal, are 
available for inspection. 

* Keep the Monte Carlo parameters hidden. This is a technique suggested by the 
TWIST experiment in their high statistics precision determination of parameters 
associated with muon decay. The procedure involves comparing the data with 
various simulated sets, generated with a series of different parameter values. 
The data and the simulations are both visible, but the parameter values used to 
generate the simulations are kept hidden. 

* Keep visible only a fraction of the contents of each bin of a histogram. This is 
used by the MINOS experiment searching for neutrino oscillations; these would 
affect the energy distribution of the observed events. By keeping visible different 
unknown fractions of the data in each bin, the energy spectral shape cannot be 
determined from the visible part of the data. 


If several different groups within the same collaboration are performing similar 
analyses for extracting some specific parameter, then it is desirable to fix the 
procedure for selecting which result to present, or alternatively how to combine 
the separate results. This should be done before the results are seen, and is worth 
doing even if the individual analyses were not "blind". 

A question that arises with blind analyses is whether it should be permitted to 
modify the analysis after the data had been unblinded. It is generally agreed that 
this should not be done, unless everyone would regard it as ridiculous not to do 
so. For example, if a search for rare events yielded 10 candidates over the course 
of a year's run, all of which occurred on Sunday mornings at precisely 1.17 a.m., 
it would be prudent to do some further investigation before publishing. If ‘post- 
unblinding' modification of the procedure is performed, this should be made clear 
in any publication. 


15.9 Topics that Deserve More Attention 


15.9.1 Statistical Software 


Particle physicists tend to write their own software for performing statistical compu- 
tations. Although this has educational merits, it is inefficient use of one's time. The 
data-manipulation system of programmes ROOT/RooFit/RooStats contains many 
useful statistical routines[96]. Tools also exist for implementing many methods for 
separating signal from background[43, 44]. 

A problem with these is that they are too easy to use. In the hands of a non- 
critical user, the required input data instructions may contain some error, with 
the consequence that they will produce the solution to a different procedure than 
the intended one. It is very important to check that the result obtained is not 
unreasonable. 
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15.9.2 Deep Learning 


This involves the use of sophisticated techniques[45] for achieving nearly optimal 
extraction of information from data, but which are still relatively unfamiliar to many 
scientists. It is important to develop a set of protocols to ensure that they perform in 
a reliable manner, and are not introducing subtle biases of which users are unaware. 


15.9.3 Unfolding Data or Smearing Theory? 


Observed experimental distributions are almost always smeared versions of ‘the true 
distributions of Nature'. It is simpler to compare theory and data by smearing the 
theory, rather than trying to unfold the experimental effects from the data, as the 
latter is a less stable procedure and also introduces correlations among the bins 
of the unfolded distribution. Some fields tend to favour deconvolution; this is partly 
because it is rarer for them to have a dominant theoretical model with which the data 
is to be compared. Unfolding does have the advantage that it provides an estimate 
of the *true' distribution, with which any future theory can be compared. Also it can 
be looked at by a physicist, but we are not accustomed to readily interpreting data 
where the contents of the histogram bins are highly correlated. 

There are some situations where unfolding is desirable. For example, it allows the 
comparison of distributions from different experiments, with different resolutions. 
Another is using experimental data for tuning Monte Carlo generators; smearing the 
data at each step of the optimisation increases the computation time too much. 

Even for checking in future whether new theories are compatible with data does 
not necessarily require unfolding. Provided that the smearing matrix of the detector 
is provided, the future data can be smeared, and then compared with the actual (not 
unfolded) data. However, including the effects of systematics can be a complication. 

Sessions at the 2011 PHYSTAT workshop[9] and at CERN's PHYSTATv 
meeting[12] were devoted to unfolding. Blobel[97] has reviewed the topic, while 
ref. [98] contains a statistician's view of the statistical issues involved in unfolding. 


15.9.4 Visualisation 


The combination of the human eye and brain is very powerful at detecting patterns 
in data (even if sometimes they are not there!) This can be useful in deciding how 
to analyse the data; as a check on whether the result of an analysis is plausible; 
whether a machine learning method for separating signal from background is 
performing sensibly; etc. Such human inspection of data is feasible if there are only 
a small number (below 4) of relevant variables. Techniques for inspecting multi- 
dimensional data would be valuable. 
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15.9.5 Non-parametric Methods 


These are so unknown to most Particle Physicists that they are usually unaware 
when they are using them. Simple examples include: 


* A histogramme as an estimate of the density distribution of a variable of interest. 

* Kernel density estimation. 

* Kolmogorov-Smirnov or Anderson-Darling methods, to test whether distribu- 
tions are consistent. 

* Classification schemes based on k nearest neighbours. 

* Neural networks 


These all avoid the need to specify a particular parametric form, and hence 
the values of any parameters. In general such a method is less powerful than a 
parametric one, if the latter were available and relevant. 


15.9.6 Collaboration with Statisticians 


Other scientists seem to be better than particle physicists about involving statisti- 
cians in the analysis of their data. This is partly due to the fact that we like to try 
out statistical techniques ourselves; that we consider our data is too complicated for 
other people to deal with; and that we are somewhat over-protective of our data, 
and are reluctant to share it with others. None of this is particularly convincing, and 
it is clear that we would benefit from the involvement of professional statisticians. 
The advantages of having them participating in the recent PHYSTAT meetings have 
been obvious. 

In the past, Particle Physicists have on occasion asked rather specific questions 
to Statisticians they happened to know. Statisticians prefer to be much more directly 
involved with the data itself. With analyses becoming more and more complex, it 
will be highly desirable for them to be affiliated with experimental groups. 


15.10 Conclusion 


Although the statistical aspects of many particle physics analyses are already at a 
sophisticated level, it is clear that there are many practical statistical issues to be 
resolved. With the increasing complexity of scientific investigations, more active 
collaboration with statisticians and machine learning experts will result in a better 
understanding of the relevant techniques and improved analyses in the future. 
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Chapter 16 A 
Integration of Detectors into a Large gsti 
Experiment: Examples from ATLAS 

and CMS 


Daniel Froidevaux 


16.1 Introduction 


16.1.1 The Context 


The Large Hadron Collider (LHC) is the proton-proton accelerator which began 
operation in 2010 in the existing LEP tunnel at CERN in Geneva, Switzerland. 
It represents the next major step in the high-energy frontier beyond the Fermilab 
Tevatron (proton-antiproton collisions at a centre-of-mass energy of 2 TeV), with 
its design centre-of-mass energy of 14 TeV and luminosity of 10°4cm7?s~!. The 
high design luminosity is required because of the small cross-sections expected 
for many of the benchmark processes (Higgs-boson production and decay, new 
physics scenarios such as supersymmetry, extra dimensions, etc.) used to optimise 
the design of the general-purpose detectors over a period of 15 years or so. To 
achieve this luminosity and minimise the impact of simultaneous inelastic collisions 
occurring at the same time in the detectors (a phenomenon usually called pileup), 
the LHC beam crossings are 25 ns apart in time, resulting in 23 inelastic interactions 
per crossing on average at design luminosity. Two general-purpose experiments, 
ATLAS and CMS, were proposed for operation at the LHC in 1994 [1], and 
approved for construction in 1995. The experimental challenges undertaken by 
these two projects of unprecedented size and complexity in the field of high-energy 
physics, the construction and integration achievements realised over the years 2000— 
2008, and the expected performance of the commissioned detectors are described in 
a variety of detailed documents, such as the detector papers [2, 3]. In this chapter, 
much of the description of the lessons learned based on this huge effort, and of 
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the comparisons in terms of expected performance have been taken and somewhat 
updated from a recent review [4]. For completeness, it is important to mention 
also the two more specialised and smaller experiments, ALICE [5] and LHCb [6]. 
In 2019, at a moment when the accelerator and experiments have just completed 
very successfully the so-called run-2 with 4 years of operation at a centre-of-mass 
energy of 13 TeV, and after run-1 with operation at lower energies topped with the 
discovery of the Higgs boson, it is interesting to look back not only on the period of 
construction and integration with its great expectations, a period which is the main 
focus of this chapter, but also on almost 10 years of operation and data-taking with 
its own challenges and of course with the excitement stemming from the analysis of 
real data. 

The prime motivation of the LHC is to elucidate the nature of electroweak 
symmetry breaking, for which the Higgs mechanism is presumed to be responsible. 
The experimental study of the Higgs mechanism can also shed light on the 
consistency of the Standard Model at energy scales above | TeV. The Higgs boson is 
generally expected to have a mass below about 200 GeV [7]. This expectation could 
be relaxed if there are problems in the interpretation of the precision electroweak 
data [8] or if there are additional contributions to the electroweak observables [9]. A 
variety of models without Higgs bosons have also been proposed more recently, 
together with mechanisms of partial unitarity restoration in longitudinal vector 
boson scattering at the TeV scale [10]. All these possibilities may appear to be 
remote, but they serve as a reminder that the existence of a light Higgs boson cannot 
be taken for granted. 

Theories or models beyond the Standard Model invoke additional symmetries 
(supersymmetry) or new forces or constituents (strongly-broken electroweak sym- 
metry, technicolour). It is generally hoped that discoveries at the LHC could provide 
insight into a unified theory of all fundamental interactions, for example in the form 
of supersymmetry or of extra dimensions, the latter requiring modification of gravity 
at the TeV scale. There are therefore several compelling reasons for exploring the 
TeV scale and the search for supersymmetry is perhaps the most attractive one, 
particularly since preserving the naturalness of the electroweak mass scale requires 
supersymmetric particles with masses below about | TeV. 


16.1.2 The Main Initial Physics Goals of ATLAS and CMS at 
the LHC 


There have been many studies of the LHC discovery potential as a function of the 
integrated luminosity and the ones released just before data-taking [11, 12] have 
focussed on the first few years, over which about 10 fb~! of integrated luminosity 
were expected to be accumulated by each experiment. 

With some optimism that the performance of the ATLAS and CMS detectors 
would be understood rapidly and would be close to expectations, the expectations 
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Fig. 16.1 Integrated 
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at the time were that a Standard Model Higgs boson could be discovered at the 
LHC with a significance above 5o over the full mass range of interest and for an 
integrated luminosity of only 5 fb~!, as shown in Fig. 16.1. This discovery potential 
should, however, be taken with a grain of salt, since the evidence for a light Higgs 
boson of mass in the 110-130 GeV range would not only have to be combined 
over both experiments but also over several channels with very different final states 
(H — yy decays in association with various jet topologies, tt H production with 
H — bb decay and qq H production with H — tt decay). Achieving the required 
sensitivity in each of these channels would require an excellent understanding of 
the detailed performance of most elements of these complex detectors and would 
therefore require sufficient experimental data and time. 

The discovery potential for supersymmetry was expected to be very substan- 
tial in the very first months of data-taking, since only 100 pb^! of integrated- 
]uminosity would be sufficient to discover squarks or gluinos with masses below 
about 1.3 TeV [1, 11, 13], a large increase in sensitivity with respect to that 
ultimately achieved at the Tevatron. This sensitivity would increase to 1.7 TeV 
for an integrated luminosity of 1 fb^! and to about 2.2 TeV for 10fb~!, as shown 
in Fig. 16.2. 

The few examples above illustrate the wide range of physics opened up by the 
seven-fold increase in energy from the Tevatron to the LHC. Needless to say, all 
Standard Model processes of interest, QCD jets, vector bosons and especially top 
quarks, would be produced in unprecedented abundance at the LHC, as illustrated 
in Table 16.1, and would therefore be studied with high precision by ATLAS and 
CMS. 
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Table 16.1 For a variety of physics processes expected to be the most abundantly produced at the 


LHC, expected numbers of events recorded by ATLAS and CMS for an integrated luminosity of 
1 fb—! per experiment 


Physics process Number of events per 1 fb! 

QCD jets with Er > 150GeV 10° (for 10% of trigger bandwidth) 
W — uv 7.0 - 106 

Zu 1.1 - 10° 

tt > e/u+X 1.6-10° 

Gluino-gluino production (mass about 10? ...10? 

1 TeV) 


16.1.3 A Snapshot of the Current Status of the ATLAS 
and CMS Experiments 


From the year 2000 to end of 2009, the experiments have had to deal in parallel with 
a very complex set of tasks requiring a wide diversity of skills and personnel: 


* the construction of the major components of the detectors was complete or 
nearing completion at the end of 2006, after a very long period of research 
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Fig. 16.3 Left: picture of the ATLAS barrel toroid superconducting magnet with its eight coils of 
25 m length and of the ATLAS barrel calorimeter with its liquid Argon electromagnetic calorimeter 
and its scintillating tile hadronic calorimeter, as installed in the experimental cavern. Right: picture 
of the first end-cap LAr cryostat, including the electromagnetic, hadronic and forward calorimeters, 
as it is lowered into its docking position on one side of the ATLAS pit 


Fig. 16.4 Left: picture of the CMS superconducting solenoid, as integrated with the barrel muon 
system (outside) and with the barrel hadron calorimetry (inside). Right: picture of the insertion of 
the CMS silicon-strip tracker into the barrel crystal calorimeter 


and development, including validation in terms of survival to irradiation and 
preparation of industrial manufacturing; 

* the integration and installation phase began approximately in 2003 and extended 
all the way to 2007 for the last major components. ATLAS was being installed 
and commissioned directly in its underground cavern (see Fig. 16.3). In contrast, 
CMS is modular enough that it could be assembled above ground (see Fig. 16.4). 

* the commissioning of the experiments with cosmic rays began in 2006, with 
the biggest campaigns in 2008 and 2009. These have yielded a wealth of initial 
results on the performance of the detectors in situ, a very important asset to ensure 
a rapid commissioning of the detectors for physics with collisions; 

* the next commissioning step was achieved in an atmosphere of great excitement 
with first collisions at the injection energy of 900 GeV of the LHC machine 
and with very low luminosities of the order of 1026— 10? cm~*s~!. All detector 
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components were able to record significant samples of data, albeit at low energy 
and with insufficient statistics to fully commission the trigger and reconstruction 
algorithms dedicated to provide the signatures required for the initial Standard 
Model measurements and searches for new physics. 


In parallel with the rapidly evolving integration, installation and commissioning 
effort at the experimental sites, the collaborations have also reorganised themselves 
to evolve as smoothly and efficiently as possible from a distributed construction 
project with a strong technical co-ordination team to a running experiment with the 
emphasis shifting to monitoring of the detector and trigger operation, understanding 
of the detector performance in the real LHC environment and producing the first 
physics results. A small but significant part of the human and financial resources are 
already focusing on the necessary upgrades to the experiments required by the LHC 
luminosity upgrade programme. 

This chapter has been structured in the following way: Sect. 16.2 presents 
an overview of the ATLAS and CMS projects in terms of their main design 
characteristics, describes briefly the magnet systems, and summarises the main 
lessons learned from the 15-year long research and development and construction 
period. The next three sections, Sects. 16.3—16.5, describe in more detail the main 
features and challenges related respectively to the inner tracker, to the calorimetry 
and to the muon spectrometer, in the specific case of the ATLAS experiment. The 
subsequent two sections, Sects. 16.6 and 16.7, discuss in broad terms the various 
aspects of, respectively, the trigger and data acquisition system and the computing 
and software, again in the context of the ATLAS experiment. The next section, 
Sect. 16.8, summarises and compares briefly the expected performances at the time 
of beginning of data-taking of the main ATLAS and CMS systems. The last and 
final section, Sect. 16.9, gives a very brief overview of the performance and physics 
results achieved over the past 10 years. 


16.2 Overall Detector Concept and Magnet Systems 


This section presents an overview of the ATLAS and CMS detectors, based on 
the main physics arguments which guided the conceptual design, and describes 
the magnet systems, which have driven many of the detailed design aspects of the 
experiments. 


16.2.1 Overall Detector Concept 


Figures 16.5 and 16.6 show the overall layouts respectively of the ATLAS and 
CMS detectors and Table 16.2 lists the main parameters of each experiment. Both 
experiments are designed somewhat as cylindrical onions consisting of: 
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Fig. 16.6 Overall layout of the CMS detector 
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Table 16.2 Main design Parameter ATLAS |CMS 
A BT Fal weight [Fons] 7000 12,500 
Overall diameter [m] 22 15 
Overall length [m] 46 20 
Magnetic field for tracking [T] 2 4 
Solid angle for precision 2n x 5.0 | 27 x 5.0 
measurements (Ag x An) 
Solid angle for energy 2m x 9.6 |2z x 9.6 
measurements (Ag x An) 
Total cost (MCHF) 550 550 


* aninnermost layer devoted to the inner trackers, bathed in a solenoidal magnetic 
field and measuring the directions and momenta of all possible charged particles 
emerging from the interaction vertex; 

* an intermediate layer consisting of electromagnetic and hadronic calorimeters 
absorbing and measuring the energies of electrons, photons and hadrons; 

e an outer layer dedicated to the measurement of the directions and momenta of 
high-energy muons escaping from the calorimeters. 


To complete the coverage of the central part of the experiments (often called barrel), 
so-called end-cap detectors (calorimetry and muon spectrometers) are added on each 
side of the barrel cylinders. 

The sizes of ATLAS and CMS are determined mainly by the fact that they 
are designed to identify most of the very energetic particles emerging from the 
proton-proton collisions and to measure as efficiently and precisely as feasible 
their trajectories and momenta. The interesting particles are produced over a very 
wide range of energies (from a few hundred MeV to a few TeV) and over the full 
solid angle. They need therefore to be detected down to very small polar angles 
(0) with respect to the incoming beams (a fraction of a degree, corresponding 
to pseudorapidities 7 of up to 5, where y = —log[tan(@/2)]; pseudorapidity is 
more commonly used at hadron colliders because the rates for most hard-scattering 
processes of interest are constant as a function of 7). Most of the energy of the 
colliding protons is however dissipated in shielding and collimators close to the 
focussing quadrupoles (on each side of the experimental caverns, which house 
the experiments). The overall radiation levels will therefore be very high: many 
components in the detectors will become activated and will require special handling 
during maintenance, particularly near the beams. 

For all the above reasons, both experiments have been designed following similar 
guiding principles: 


* No particle of interest should escape unseen (except neutrinos, which will 
therefore be identified because their presence will cause an imbalance in the 
energy-momentum conservation laws governing the interactions measured in the 
experiments). The consequences of this simple statement are profound and far- 
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reaching when one goes beyond simple sketches and simulations to the details of 
the real experiment: 


— successful operation of detectors able to measure the energies of particles with 
polar angles as small as one degree with respect to the incoming beams has 
required quite some inventiveness in material technology and a lot of detailed 
validation work to qualify the so-called forward calorimeters in terms of the 
very large radiation doses and particle densities encountered so close to the 
beams. Similar issues have been addressed of course very early on for the 
trackers, the main concerns being damage to semi-conductors (sensors and 
integrated circuits) and ageing of gaseous detectors. Even the muon detectors, 
to the initial surprise of the community, were confronted with irradiation and 
high-occupancy issues from neutron-induced cavern backgrounds pervading 
the whole experimental area; 

— avoiding any cracks in the acceptance of the experiment (especially cracks 
pointing back to the interaction region) has been a challenge of its own in 
terms of minimising the thickness of the LAr cryostats in ATLAS and of 
properly routing the large number of cables required to operate the ATLAS 
and CMS inner trackers; 

— if no particle can escape from the large volumes occupied by the experiments, 
then it becomes very hard for human beings to enter for rapid maintenance and 
repair. The access and maintenance scenarios for both experiments are quite 
complex and any major operation will only be feasible during long shutdowns 
of the accelerators. The detector design criteria have therefore become close 
to those required for space applications in terms of robustness and reliability 
of all the components. 


* The high particle fluxes and harsh radiation conditions prevailing in the experi- 
mental areas have forced the collaborations to foresee redundancy and robustness 
for the measurements considered to be most critical. A few of the most prominent 
examples are described below: 


— CMS has chosen the highest possible magnetic field (4 T) combined with 
an inner tracker consisting sole